<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Neurol.</journal-id>
<journal-title>Frontiers in Neurology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Neurol.</abbrev-journal-title>
<issn pub-type="epub">1664-2295</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fneur.2022.951401</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Neurology</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Machine learning approach for hemorrhagic transformation prediction: Capturing predictors&#x00027; interaction</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author" corresp="yes">
<name><surname>Elsaid</surname> <given-names>Ahmed F.</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="corresp" rid="c001"><sup>&#x0002A;</sup></xref>
<xref ref-type="author-notes" rid="fn002"><sup>&#x02020;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/1828739/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Fahmi</surname> <given-names>Rasha M.</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="author-notes" rid="fn003"><sup>&#x02020;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/2089767/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Shehta</surname> <given-names>Nahed</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
<xref ref-type="author-notes" rid="fn004"><sup>&#x02020;</sup></xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Ramadan</surname> <given-names>Bothina M.</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Department of Public Health and Community Medicine, Zagazig University</institution>, <addr-line>Zagazig</addr-line>, <country>Egypt</country></aff>
<aff id="aff2"><sup>2</sup><institution>Neurology Department, Faculty of Medicine, Zagazig University</institution>, <addr-line>Zagazig</addr-line>, <country>Egypt</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Robin Lemmens, University Hospitals Leuven, Belgium</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Anna Bonkhoff, Massachusetts General Hospital and Harvard Medical School, United States; Alex Jung, Aalto University, Finland</p></fn>
<corresp id="c001">&#x0002A;Correspondence: Ahmed F. Elsaid <email>afelsaid&#x00040;zu.edu.eg</email>; <email>elsaid_af&#x00040;hotmail.com</email></corresp>
<fn fn-type="other" id="fn001"><p>This article was submitted to Stroke, a section of the journal Frontiers in Neurology</p></fn>
<fn fn-type="other" id="fn002"><p>&#x02020;ORCID: Ahmed F. Elsaid <ext-link ext-link-type="uri" xlink:href="https://orcid.org/0000-0002-4620-3933">orcid.org/0000-0002-4620-3933</ext-link></p></fn>
<fn fn-type="other" id="fn003"><p>Rasha M. Fahmi <ext-link ext-link-type="uri" xlink:href="https://orcid.org/0000-0003-3934-2254">orcid.org/0000-0003-3934-2254</ext-link></p></fn>
<fn fn-type="other" id="fn004"><p>Nahed Shehta <ext-link ext-link-type="uri" xlink:href="https://orcid.org/0000-0001-5121-6117">orcid.org/0000-0001-5121-6117</ext-link></p></fn></author-notes>
<pub-date pub-type="epub">
<day>24</day>
<month>11</month>
<year>2022</year>
</pub-date>
<pub-date pub-type="collection">
<year>2022</year>
</pub-date>
<volume>13</volume>
<elocation-id>951401</elocation-id>
<history>
<date date-type="received">
<day>24</day>
<month>05</month>
<year>2022</year>
</date>
<date date-type="accepted">
<day>28</day>
<month>10</month>
<year>2022</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2022 Elsaid, Fahmi, Shehta and Ramadan.</copyright-statement>
<copyright-year>2022</copyright-year>
<copyright-holder>Elsaid, Fahmi, Shehta and Ramadan</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license></permissions>
<abstract>
<sec>
<title>Background and purpose</title>
<p>Patients with ischemic stroke frequently develop hemorrhagic transformation (HT), which could potentially worsen the prognosis. The objectives of the current study were to determine the incidence and predictors of HT, to evaluate predictor interaction, and to identify the optimal predicting models.</p></sec>
<sec>
<title>Methods</title>
<p>A prospective study included 360 patients with ischemic stroke, of whom 354 successfully continued the study. Patients were subjected to thorough general and neurological examination and T2 diffusion-weighted MRI, at admission and 1 week later to determine the incidence of HT. HT predictors were selected by a filter-based minimum redundancy maximum relevance (mRMR) algorithm independent of model performance. Several machine learning algorithms including multivariable logistic regression classifier (LRC), support vector classifier (SVC), random forest classifier (RFC), gradient boosting classifier (GBC), and multilayer perceptron classifier (MLPC) were optimized for HT prediction in a randomly selected half of the sample (training set) and tested in the other half of the sample (testing set). The model predictive performance was evaluated using receiver operator characteristic (ROC) and visualized by observing case distribution relative to the models&#x00027; predicted three-dimensional (3D) hypothesis spaces within the testing dataset true feature space. The interaction between predictors was investigated using generalized additive modeling (GAM).</p></sec>
<sec>
<title>Results</title>
<p>The incidence of HT in patients with ischemic stroke was 19.8%. Infarction size, cerebral microbleeds (CMB), and the National Institute of Health stroke scale (NIHSS) were identified as the best HT predictors. RFC (AUC: 0.91, 95% CI: 0.85&#x02013;0.95) and GBC (AUC: 0.91, 95% CI: 0.86&#x02013;0.95) demonstrated significantly superior performance compared to LRC (AUC: 0.85, 95% CI: 0.79&#x02013;0.91) and MLPC (AUC: 0.85, 95% CI: 0.78&#x02013;0.92). SVC (AUC: 0.90, 95% CI: 0.85&#x02013;0.94) outperformed LRC and MLPC but did not reach statistical significance. LRC and MLPC did not show significant differences. The best models&#x00027; 3D hypothesis spaces demonstrated non-linear decision boundaries suggesting an interaction between predictor variables. GAM analysis demonstrated a linear and non-linear significant interaction between NIHSS and CMB and between NIHSS and infarction size, respectively.</p></sec>
<sec>
<title>Conclusion</title>
<p>Cerebral microbleeds, NIHSS, and infarction size were identified as HT predictors. The best predicting models were RFC and GBC capable of capturing nonlinear interaction between predictors. Predictor interaction suggests a dynamic, rather than, fixed cutoff risk value for any of these predictors.</p></sec></abstract>
<kwd-group>
<kwd>ischemic stroke</kwd>
<kwd>hemorrhagic transformation</kwd>
<kwd>machine learning</kwd>
<kwd>cerebral microbleeds</kwd>
<kwd>NIHSS</kwd>
<kwd>infarction size</kwd>
</kwd-group>
<counts>
<fig-count count="4"/>
<table-count count="3"/>
<equation-count count="2"/>
<ref-count count="57"/>
<page-count count="14"/>
<word-count count="8592"/>
</counts>
</article-meta>
</front>
<body>
<sec sec-type="intro" id="s1">
<title>Introduction</title>
<p>Patients with ischemic stroke are at risk of developing hemorrhagic transformation (HT), which could be defined as bleeding within the infarcted area. HT could be precipitated spontaneously or secondary to anticoagulant or thrombolytic reperfusion therapy for ischemic stroke. The reported incidence of HT in patients with stroke varied widely from 0.6 to 85% (<xref ref-type="bibr" rid="B1">1</xref>). HT could develop asymptomatically, detected only by CT or MRI, or symptomatically as evident by the associated worsening of existent neurological/clinical manifestation (<xref ref-type="bibr" rid="B2">2</xref>). Reperfusion injury had been proposed as the major pathophysiologic mechanism underlying the development of HT. The incidence of HT in ischemic stroke poses a significant risk of deterioration because extravasation of blood could exaggerate inflammatory reactions and promote the progression of brain damage (<xref ref-type="bibr" rid="B3">3</xref>, <xref ref-type="bibr" rid="B4">4</xref>). Therefore, detecting susceptibility to HT could facilitate designing management plans for patients with high-risk.</p>
<p>The interaction among risk factors, metabolic, and signaling pathways is a determinant factor in the pathogenies of ischemic stroke, HT, and related complications. The effect of variable interaction on the outcome could be defined by the variance of the outcome that cannot be explained by the main effect of independent factors alone (<xref ref-type="bibr" rid="B5">5</xref>). The interaction between predictors is expected when the slope of the relationship between one predictor and the outcome is dependent on (or a function of) another predictor (<xref ref-type="bibr" rid="B6">6</xref>). Therefore, describing the interaction between predictors is important for the proper prediction of the outcome. Machine learning (ML) algorithms became increasingly utilized in stroke diagnosis and outcome prediction (<xref ref-type="bibr" rid="B7">7</xref>). ML could be broadly classified into supervised and unsupervised based on whether the training data have labeled or unlabeled outcomes, respectively. Supervised ML could be utilized for classification, whereas unsupervised ML could be used to identify hidden patterns such as clustering or abnormal anomaly within the data. Among the most commonly used supervised ML classifiers are logistic regression classifier (LRC), support vector classifiers (SVC), random forest classifier (RFC), decision tree-based gradient boosting classifier (GBC), and multilayer perceptron classifier (MLPC). Machine learning algorithms were reported to efficiently analyze complex non-linear interactions between variables and were utilized to develop prediction models in a variety of clinical settings (<xref ref-type="bibr" rid="B8">8</xref>&#x02013;<xref ref-type="bibr" rid="B10">10</xref>).</p>
<p>The objective of the current study was to determine the incidence and predictors of HT, to evaluate predictor interaction, and to identify the best predicting models taking advantage of the state-of-the-art ML algorithms.</p></sec>
<sec id="s2">
<title>Patients and methods</title>
<sec>
<title>Problem formulation</title>
<p>Hemorrhagic transformation prediction was modeled as a supervised learning problem with the incidence of HT as a binary label and selected patient characteristics as features. A total of 16 features including patient clinical data, laboratory findings, and magnetic resonance imaging (MRI) markers were studied. Feature data were collected at patient admission, whereas determining labels was performed using T2 diffusion-weighted MRI conducted at admission and 1 week after.</p></sec>
<sec>
<title>Patients and study design</title>
<p>The current prospective study originally included a total of 360 patients with ischemic stroke of whom six patients died before taking the second MRI and 354 completed the study. Participants were recruited using systemic random technique (every 3rd admission) from stroke and intensive care units (ICU), Zagazig University Hospitals, Egypt from January 2018 to February 2020. Participants were subjected to two T2 diffusion-weighted MRIs, at admission and after 1 week to diagnose HT.</p></sec>
<sec>
<title>Inclusion criteria</title>
<p>Any patients with ischemic stroke aged &#x02265;18 years old.</p></sec>
<sec>
<title>Exclusion criteria</title>
<p>Any patients who received rtPA or suffered from subarachnoid hemorrhage, intracerebral bleeding or any head injuries, hematological disorders, serious liver or renal impairment, coexistent brain infection, tumor, or congenital malformations.</p></sec>
<sec>
<title>Clinical and laboratory features</title>
<p>Thorough general and neurological examinations, including NIHSS, were performed. Clinical risk factors were defined as follows: hypertension (receiving medications for hypertension or blood pressure &#x0003E;140/90 mmHg on repeated measurements), diabetes mellitus (receiving medications for diabetes mellitus, fasting blood sugar &#x02265;126 mg/dL or HbA1c &#x02265;6.5%, or a casual plasma glucose &#x0003E;200 mg/dL), hypercholesterolemia [receiving cholesterol-reducing agents or an overnight fasting cholesterol level &#x02265;200 mg/dL, triglycerides &#x02265;200 mg/dL, or low-density lipoprotein (LDL) cholesterol &#x02265;160 mg/dL], in addition to a history of previous heart disease. Performed laboratory tests included blood glucose level, complete blood count (CBC), platelet count, partial thromboplastin time, liver function tests, renal function tests, lipid profile, and erythrocyte sedimentation rate (ESR).</p></sec>
<sec>
<title>MRI variables/markers and imaging protocol</title>
<p>Conventional MR, diffusion-weighted image, and GRET2WI (T2<sup>&#x0002A;</sup>WI) were performed using a 1.5 T MR Scanner (Achieva, Philips Medical System). Images were obtained with the patient in the supine position, and a scout sagittal T1-weighted image was used as a localizer; then, multiple pulse sequences were performed to obtain axial, coronal, and sagittal images. The conventional MR sequences included the following: (1) sagittalT1-weighted image as a localizer (TE 8/TR 500 ms), (2) axial T1W-weighted image (TR148-597/TE2-15 ms), (3) axial and sagittal T2-weighted images (TR4400- 4800/TE110 ms), and (4) axial FLAIR (TR6000/TE120-TI2000). Section thickness was set at 5 mm and a gap of 1 mm. We used a field of view equal to 30 mm in coronal images and 240 mm in axial images. T2<sup>&#x0002A;</sup>WI parameters included (TR/TE 641/23 ms) a flip angle of 20, matrix 512 &#x000D7; 512, with a field of view of 250 mm.</p>
<p>For diffusion-weighted imaging, we used a multisection single-shot spin echo planner imaging sequence (TR/TE/NEX: 4.200/140 ms/I) with diffusion sensitivities b-values set at 0 and 1,000 s/mm<sup>2</sup> and total acquisition time of 80 s. The reconstructed images were transferred to the workstation for the calculation of apparent diffusion coefficient values (<xref ref-type="bibr" rid="B11">11</xref>). Analysis was performed by a radiologist ignorant of the study using the software DICOM viewer (v3.0 /v7.2.0.1; Philips Medical Systems Nederland B.V, Best, Netherlands).</p>
<p>Cerebral microbleeds (CMBs) were defined as black (signal loss) or hypointense rounded areas outside the infarcted areas and with &#x0003C;10 mm in diameter on T2<sup>&#x0002A;</sup>Wl (<xref ref-type="bibr" rid="B12">12</xref>). The rating of CMBs was performed using the validated Brain Observer Microbleed Scale (<xref ref-type="bibr" rid="B13">13</xref>). Superficial siderosis was defined as gyriform hypointensity without corresponding hyperintense signal on T1-weighted sequences or FLAIR (<xref ref-type="bibr" rid="B14">14</xref>). Infarction size was determined by the largest diameter of the lesion (<xref ref-type="bibr" rid="B15">15</xref>).</p>
<p>The location of ischemic stroke was determined according to Bamford classification as partial anterior circulation (PAC), total anterior circulation (TAC), lacunar, and posterior circulation (POCS) (<xref ref-type="bibr" rid="B16">16</xref>). Etiological classification was performed according to the TOAST criteria (Trial of ORG 10172 in Acute Stroke Treatment) into small, large, cardiac, and undermined (<xref ref-type="bibr" rid="B17">17</xref>).</p></sec>
<sec>
<title>Label</title>
<p>The incidence of HT was defined as any degree of hyperdensity within the area of low attenuation (<xref ref-type="bibr" rid="B18">18</xref>). The type of HT was determined according to ECASS II classification (European Cooperative Acute Stroke Study) into hemorrhagic infarction (HI) and parenchymal hematoma (PH) (<xref ref-type="bibr" rid="B19">19</xref>). HI was further classified into petechiae affecting the infarction margin (HI1) and confluent petechiae with no mass effect within the infarct area (HI2). PH was subdivided into PH1 affecting &#x0003C;30% of the infarct area and PH2 affecting more than 30% of the infarct area with mass effect (<xref ref-type="bibr" rid="B19">19</xref>).</p></sec>
<sec>
<title>Ethical conduct and institutional approval</title>
<p>Informed consent was obtained from all participants or their relatives. Ethical approval was obtained from the local ethics committee of our hospital.</p></sec>
<sec>
<title>Statistical analysis</title>
<sec>
<title>Descriptive analysis and feature selection</title>
<p>Clinical, neurological, laboratory, and imaging data were introduced at the initiation (admission) and 7 days later. Continuous and categorical variables were presented as mean &#x000B1; SD and percentages and were compared using <italic>t</italic>-test and chi-square, respectively. The 354 samples were randomly split (the random seed number was set to 64) into two datasets (177 subjects each) to be used as training and testing datasets. Univariate analysis was performed in the training dataset to describe the frequency distribution of variables among HT positive and negative cases. Variable selection is an important step in model building to produce a parsimonious model with reduced data dimensionality, enhanced interpretability, reduced overfitting, and enhanced generalizability (<xref ref-type="bibr" rid="B20">20</xref>, <xref ref-type="bibr" rid="B21">21</xref>). We performed variable selection using a filter-based technique, using the minimum redundancy maximum relevance (mRMRe) algorithm, which is provided by the varrank package in R (<xref ref-type="bibr" rid="B22">22</xref>, <xref ref-type="bibr" rid="B23">23</xref>).</p>
<p>The mRMRe algorithm selects variables based solely on the characteristics of input data, independent of any model developing algorithm, and thus is robust to overfitting. In contrast to linear correlation, mRMRe algorithms could measure non-linear relationships and remain invariant under variable inevitable transformations (<xref ref-type="bibr" rid="B24">24</xref>).</p></sec>
<sec>
<title>Model development, optimization, and comparison</title>
<p>Logistic regression classifier, SVC, RFC, GBC, and MLPC models were developed using the identified predictors, optimized in the training dataset, and used to predict HT incidence in the testing dataset. Grid search was utilized for hyperparameter tuning for GBC, SVC, MLPC, and LRC, whereas automated Bayesian hyperparameter tuning was utilized for RFC. Models were optimized <italic>via</italic> minimizing loss function to reduce misclassification errors in the training dataset (<xref ref-type="bibr" rid="B25">25</xref>). SKlearn built-in log-loss function was used for training GBC, LRC, and MLPC, the squared hinge function was used for training SVC, whereas the Gini function was used for RFC. The log-loss function is defined as <italic>L</italic><sub>log</sub>(<italic>y, p</italic>) &#x0003D; &#x02212;(<italic>ylog</italic>(<italic>p</italic>) &#x0002B; (1&#x02212;<italic>y</italic>)log(1&#x02212;<italic>p</italic>)), where <italic>L</italic><sub>log</sub> is the log loss, <italic>y</italic> is the true label, and <italic>p</italic> is the probability estimate that <italic>y</italic> &#x0003D; 1. The log-loss function will be reduced to &#x02212;log(1&#x02212;<italic>p</italic>)<italic>if y</italic> &#x0003D; 0, and &#x02212;log(<italic>p</italic>)<italic>if y</italic> &#x0003D; 1. The squared hinge loss is defined as</p>
<disp-formula id="E1"><mml:math id="M1"><mml:mi>L</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mtext>y</mml:mtext><mml:mo>,</mml:mo><mml:mtext>&#x000A0;</mml:mtext><mml:mover accent='true'><mml:mtext>y</mml:mtext><mml:mo stretchy='true'>&#x0005E;</mml:mo></mml:mover></mml:mrow><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mtext>&#x000A0;</mml:mtext><mml:mstyle displaystyle='true'><mml:msubsup><mml:mo>&#x02211;</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>0</mml:mn></mml:mrow><mml:mtext>N</mml:mtext></mml:msubsup><mml:mrow><mml:mtext>&#x000A0;</mml:mtext><mml:mo stretchy='false'>(</mml:mo><mml:mi>max</mml:mi><mml:msup><mml:mrow><mml:mo stretchy='false'>(</mml:mo><mml:mn>0</mml:mn><mml:mo>,</mml:mo><mml:mtext>&#x000A0;</mml:mtext><mml:mn>1</mml:mn><mml:mo>&#x02212;</mml:mo><mml:mtext>&#x000A0;</mml:mtext><mml:msub><mml:mtext>y</mml:mtext><mml:mrow><mml:mtext>i&#x000A0;</mml:mtext></mml:mrow></mml:msub><mml:mo>&#x000B7;</mml:mo><mml:mtext>&#x000A0;</mml:mtext><mml:mover accent='true'><mml:mtext>y</mml:mtext><mml:mo stretchy='true'>&#x0005E;</mml:mo></mml:mover><mml:mo stretchy='false'>)</mml:mo></mml:mrow><mml:mn>2</mml:mn></mml:msup><mml:mo>,</mml:mo></mml:mrow></mml:mstyle></mml:math></disp-formula>
<p>where y and <inline-formula><mml:math id="M2"><mml:mover accent='true'><mml:mtext>y</mml:mtext><mml:mo stretchy='true'>&#x0005E;</mml:mo></mml:mover></mml:math></inline-formula> are the true and predicted labels, respectively.</p>
<p>Random forest classifier was optimized using automated Bayesian hyperparameter tuning provided by the open-source hyperopt Python library in the Python environment (<xref ref-type="bibr" rid="B26">26</xref>). Hyperopt utilizes a Tree Parzen Estimator (TPE) algorithm, which is Bayesian optimization algorithm that instead of modeling <italic>p</italic>(<italic>y</italic>|<italic>x</italic>) directly, it models <italic>p</italic>(<italic>x</italic>|<italic>y</italic>) and <italic>p</italic>(<italic>y</italic>), such that</p>
<disp-formula id="E2"><mml:math id="M3"><mml:mi>p</mml:mi><mml:mrow><mml:mo>(</mml:mo><mml:mrow><mml:mi>x</mml:mi><mml:mo>&#x0007C;</mml:mo><mml:mi>y</mml:mi></mml:mrow><mml:mo>)</mml:mo></mml:mrow><mml:mo>=</mml:mo><mml:mrow><mml:mo stretchy='true'>{</mml:mo><mml:mrow><mml:mtable><mml:mtr><mml:mtd><mml:mrow><mml:mi>&#x02113;</mml:mi><mml:mi>x</mml:mi><mml:mtext>&#x000A0;</mml:mtext><mml:mi>i</mml:mi><mml:mi>f</mml:mi><mml:mtext>&#x000A0;&#x000A0;</mml:mtext></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mi>y</mml:mi><mml:mo>&#x0003C;</mml:mo><mml:msup><mml:mi>y</mml:mi><mml:mo>&#x02217;</mml:mo></mml:msup></mml:mrow></mml:mtd></mml:mtr><mml:mtr><mml:mtd><mml:mrow><mml:mtext>&#x000A0;g</mml:mtext><mml:mi>x</mml:mi><mml:mtext>&#x000A0;</mml:mtext><mml:mi>i</mml:mi><mml:mi>f</mml:mi><mml:mtext>&#x000A0;&#x000A0;</mml:mtext></mml:mrow></mml:mtd><mml:mtd><mml:mrow><mml:mi>y</mml:mi><mml:mo>&#x02265;</mml:mo><mml:msup><mml:mi>y</mml:mi><mml:mo>&#x02217;</mml:mo></mml:msup><mml:mo>,</mml:mo></mml:mrow></mml:mtd></mml:mtr></mml:mtable></mml:mrow></mml:mrow></mml:math></disp-formula>
<p>where <italic>l</italic>(<italic>x</italic>) is the density formed by using the observations {<italic>x</italic><sup>(<italic>i</italic>)</sup>} such that the corresponding loss <italic>f</italic>(<italic>x</italic><sup>(<italic>i</italic>)</sup>) is less than <italic>y</italic><sup>&#x0002A;</sup> and g(<italic>x</italic>) is the density formed by using the remaining observations (<xref ref-type="bibr" rid="B27">27</xref>). <xref ref-type="supplementary-material" rid="SM3">Supplementary Table 1</xref> lists the tuned hyperparameters of different models.</p>
<p>The area under the ROC curve (AUC) was used for the evaluation and comparison between models using Delong&#x00027;s non-parametric method (<xref ref-type="bibr" rid="B28">28</xref>, <xref ref-type="bibr" rid="B29">29</xref>). AUC is equivalent to the probability of a classifier to correctly discriminating and ranking positive instances higher than negative instances in a randomly chosen sample, which renders it equivalent to the Wilcoxon test of ranks (<xref ref-type="bibr" rid="B28">28</xref>). AUC is a prevalence- and threshold-independent metric that was reported to be more robust to data imbalance than accuracy (<xref ref-type="bibr" rid="B30">30</xref>). To guard against overfitting, the Youden index identified in the training dataset was used as the classification threshold for ROC analysis in the testing dataset. The model predictive performance metrics were calculated as follows: sensitivity (TP/TP &#x0002B; FN), specificity (TN/TN &#x0002B; FP), predictive positive value (TP/TP &#x0002B; FP), and predictive negative value (TN/TN &#x0002B; FN).</p>
<p>To better understand the relationship between predictor variables and HT incidence, the model 3D hypothesis spaces within the true feature space were plotted. The distribution of HT positive and negative cases relative to the 3D hypothesis space was examined. The interaction of predictors was investigated by fitting a series of the generalized additive model (GAM) with interaction terms, smooth plate regression splines, and tensor product splines using the generalized additive model provided by the mgcv package in the R environment (<xref ref-type="bibr" rid="B23">23</xref>, <xref ref-type="bibr" rid="B31">31</xref>). The Python scikit-learn library, version 0.23.2, was utilized to train and test machine learning models using Jupyter Notebook (<xref ref-type="bibr" rid="B32">32</xref>, <xref ref-type="bibr" rid="B33">33</xref>). Predicted hypothesis spaces were produced using plotly functions within the Python environment, version 4.10 (<xref ref-type="bibr" rid="B34">34</xref>).</p></sec>
<sec>
<title>Sample size</title>
<p>The incidence of HT in patients with ischemic stroke was reported to be around 8.7&#x02013;12.3% (<xref ref-type="bibr" rid="B18">18</xref>, <xref ref-type="bibr" rid="B35">35</xref>). A sample of 126 was estimated to detect the incidence of 9 &#x000B1; 5% precision with a 95% confidence level and power of 80%. Because we intended to split the sample into training and testing datasets and to compensate for any lost cases, we enrolled 360 participants, of whom 354 continued the study.</p></sec></sec></sec>
<sec sec-type="results" id="s3">
<title>Results</title>
<sec>
<title>Patient characteristics, HT incidence, and predictors</title>
<p>Fifty-seven patients were excluded from participation. The main causes were refusal to participate (11), received rtPA (14), head injuries (6), and hematological/liver disorders (26). The clinical and laboratory characteristics of the 354 patients with ischemic stroke who completed the study are presented in <xref ref-type="table" rid="T1">Table 1</xref>. There was no significant difference between the training and testing datasets except for the NIHSS and PTT. The training dataset demonstrated lower average NIHSS scores and higher average PTT levels compared to the testing dataset. This difference did not influence variable selection because NIHSS, which was lower in the training dataset, was selected and not the higher level PTT. There was no significant difference between the training and testing datasets regarding stroke etiology and location. The percentages of small vessels, large vessels, cardiac, and undermined lesions were 27.1, 33.3, 30.5, and 9.0% in the training dataset and 24.3, 37.3, 28.2, and 10.2% in the testing dataset, respectively. The percentages of lesion PAC, TAC, lacunar, and POCS were 36.7, 18.1, 25.4, and 19.8% in the training dataset and 41.8, 16.9, 24.9, and 16.4% in the testing dataset, respectively. The average duration between stroke onset and ICU presentation was 7.8 (&#x000B1;2.11) in the training dataset, which was not significantly different from that in the testing dataset 7.5 (&#x000B1;2.05).</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p>Baseline characteristics of patients with ischemic stroke in the total sample and the randomly divided subgroups, the training and testing datasets.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th valign="top" align="left"><bold>Characteristic</bold></th>
<th/>
<th valign="top" align="center"><bold>Total sample</bold><break/> <bold>(<italic>N</italic> = 354)</bold></th>
<th valign="top" align="center"><bold>Training dataset</bold><break/> <bold>(<italic>N</italic> = 177)</bold></th>
<th valign="top" align="center"><bold>Testing dataset</bold><break/> <bold>(<italic>N</italic> = 177)</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Age</td>
<td/>
<td valign="top" align="center">62.8 (&#x000B1;10.5)</td>
<td valign="top" align="center">62 (&#x000B1;11)</td>
<td valign="top" align="center">63 (&#x000B1;10)</td>
</tr>
<tr>
<td valign="top" align="left">Gender (male)</td>
<td/>
<td valign="top" align="center">199 (56.2%)</td>
<td valign="top" align="center">98 (55.4%)</td>
<td valign="top" align="center">101 (57.1%)</td>
</tr>
<tr>
<td valign="top" align="left">Hypertension</td>
<td/>
<td valign="top" align="center">201 (56.8%)</td>
<td valign="top" align="center">102(57.6%)</td>
<td valign="top" align="center">99 (55.9%)</td>
</tr>
<tr>
<td valign="top" align="left">Diabetes mellitus</td>
<td/>
<td valign="top" align="center">134 (37.9%)</td>
<td valign="top" align="center">72 (40.7%)</td>
<td valign="top" align="center">61 (34.5%)</td>
</tr>
<tr>
<td valign="top" align="left">Smoking</td>
<td/>
<td valign="top" align="center">70 (19.8%)</td>
<td valign="top" align="center">33 (18.6%)</td>
<td valign="top" align="center">37 (20.9%)</td>
</tr>
<tr>
<td valign="top" align="left">Dyslipidemia</td>
<td/>
<td valign="top" align="center">135 (38.1%)</td>
<td valign="top" align="center">70 (39.5%)</td>
<td valign="top" align="center">65 (36.7%)</td>
</tr>
<tr>
<td valign="top" align="left">IHD</td>
<td/>
<td valign="top" align="center">113 (31.9%)</td>
<td valign="top" align="center">62 (35.0%)</td>
<td valign="top" align="center">51 (28.8%)</td>
</tr>
<tr>
<td valign="top" align="left">Platelets</td>
<td/>
<td valign="top" align="center">242.89 (&#x000B1;68.9)</td>
<td valign="top" align="center">241.8 (&#x000B1;71.2)</td>
<td valign="top" align="center">243.9 (&#x000B1;66.6)</td>
</tr>
<tr>
<td valign="top" align="left">PTT</td>
<td/>
<td valign="top" align="center">30.6(&#x000B1;5.5)</td>
<td valign="top" align="center">31.3 (&#x000B1;6.1)<xref ref-type="table-fn" rid="TN1"><sup>&#x0002A;</sup></xref></td>
<td valign="top" align="center">29.9 (&#x000B1;4.6)</td>
</tr>
<tr>
<td valign="top" align="left">Creatinine</td>
<td/>
<td valign="top" align="center">0.73 (&#x000B1;0.2)</td>
<td valign="top" align="center">0.75(&#x000B1;0.2)</td>
<td valign="top" align="center">0.72 (&#x000B1;0.2)</td>
</tr>
<tr>
<td valign="top" align="left">Previous stroke/TIA</td>
<td/>
<td valign="top" align="center">49 (13.8%)</td>
<td valign="top" align="center">23 (13.0%)</td>
<td valign="top" align="center">26 (14.7%)</td>
</tr>
<tr>
<td valign="top" align="left">Antiplatelet</td>
<td/>
<td valign="top" align="center">97 (27.4%)</td>
<td valign="top" align="center">55 (31.1%)</td>
<td valign="top" align="center">38 (21.5%)</td>
</tr>
<tr>
<td valign="top" align="left">Infarction size</td>
<td/>
<td valign="top" align="center">2.47 (&#x000B1;1.8)</td>
<td valign="top" align="center">2.43(&#x000B1;1.8)</td>
<td valign="top" align="center">2.5 (1.9)</td>
</tr>
<tr>
<td valign="top" align="left">NIHSS</td>
<td/>
<td valign="top" align="center">13.16 (&#x000B1;6.1)</td>
<td valign="top" align="center">11.7 (&#x000B1;5.4)</td>
<td valign="top" align="center">13.6 (&#x000B1;6.2)<xref ref-type="table-fn" rid="TN2"><sup>&#x000B1;</sup></xref></td>
</tr>
<tr>
<td valign="top" align="left">CMB</td>
<td/>
<td valign="top" align="center">2.97 (&#x000B1;4.7)</td>
<td valign="top" align="center">3.2(&#x000B1;4.9)</td>
<td valign="top" align="center">2.8 (&#x000B1;4.5)</td>
</tr>
<tr>
<td valign="top" align="left">Superficial siderosis</td>
<td/>
<td valign="top" align="center">54 (15.3%)</td>
<td valign="top" align="center">33 (18.6%)</td>
<td valign="top" align="center">21 (11.9%)</td>
</tr>
<tr>
<td valign="top" align="left">Duration form symptoms to presentation (hours)</td>
<td/>
<td valign="top" align="center">7.7 (2.08)</td>
<td valign="top" align="center">7.8 (2.11)</td>
<td valign="top" align="center">7.5 (2.05)</td>
</tr>
<tr>
<td valign="top" align="left">TOAST</td>
<td valign="top" align="left">Small</td>
<td valign="top" align="center">91 (25.7%)</td>
<td valign="top" align="center">48 (27.1%)</td>
<td valign="top" align="center">43 (24.3%)</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Large</td>
<td valign="top" align="center">125 (35.3%)</td>
<td valign="top" align="center">59 (33.3%)</td>
<td valign="top" align="center">66(37.3%)</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Cardiac</td>
<td valign="top" align="center">104 (29.4%)</td>
<td valign="top" align="center">54 (30.5%)</td>
<td valign="top" align="center">50 (28.2%)</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Undetermined</td>
<td valign="top" align="center">34 (9.6%)</td>
<td valign="top" align="center">16 (9.0%)</td>
<td valign="top" align="center">18 (10.2%)</td>
</tr>
<tr>
<td valign="top" align="left">Location</td>
<td valign="top" align="left">PACS</td>
<td valign="top" align="center">139 (39.3%)</td>
<td valign="top" align="center">65 (36.7%)</td>
<td valign="top" align="center">74 (41.8%)</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">TACS</td>
<td valign="top" align="center">62 (17.5%)</td>
<td valign="top" align="center">32 (18.1%)</td>
<td valign="top" align="center">30 (16.9%)</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Lacunar</td>
<td valign="top" align="center">89 (25.1%)</td>
<td valign="top" align="center">45 (25.4%)</td>
<td valign="top" align="center">44 (24.9%)</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">POCS</td>
<td valign="top" align="center">64 (18.1%)</td>
<td valign="top" align="center">35 (19.8%)</td>
<td valign="top" align="center">29 (16.4%)</td>
</tr>
<tr>
<td valign="top" align="left">HT incidence</td>
<td/>
<td valign="top" align="center">70 (19.8%)</td>
<td valign="top" align="center">40 (22.6%)</td>
<td valign="top" align="center">30 (16.9%)</td>
</tr>
<tr>
<td valign="top" align="left">ECASS II</td>
<td valign="top" align="left">HI1</td>
<td valign="top" align="center">20 (5.6%)</td>
<td valign="top" align="center">13 (7.3%)</td>
<td valign="top" align="center">7 (4.0%)</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">HI2</td>
<td valign="top" align="center">27 (7.6%)</td>
<td valign="top" align="center">15 (8.5%)</td>
<td valign="top" align="center">12 (6.8%)</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">PH1</td>
<td valign="top" align="center">15 (4.2%)</td>
<td valign="top" align="center">8 (4.5%)</td>
<td valign="top" align="center">7 (4.0%)</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">PH2</td>
<td valign="top" align="center">8 (2.3%)</td>
<td valign="top" align="center">4 (2.3%)</td>
<td valign="top" align="center">4 (2.3%)</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="TN1"><label>&#x0002A;</label><p>Significantly larger than that of testing dataset.</p></fn>
<fn id="TN2"><label>&#x000B1;</label><p>Significantly larger than that of training dataset.</p></fn>
<p>Table numbers represent frequencies (%) or means (&#x000B1; SD), which were compared using chi-square and two-sided <italic>t</italic>-test, respectively. The training and testing datasets were not significantly different except for PTT and NIHSS. This difference did not influence variable selection from the training dataset as evidenced by mRMR selection of the lower NIHSS and not the higher level PTT as the informative predictor of HT incidence.</p>
<p>IHD, ischemic heart diseases; PTT, partial thromboplastin time; NIHSS, National Institutes of Health Stroke Scale; CMB, cerebral microbleeds; TOAST, Trial of ORG 10172 in Acute Stroke Treatment; PACA, partial anterior circulation syndrome; TACS, total anterior circulation syndrome; POCS, posterior circulation syndrome; ECASS, European Cooperative Acute Stroke Study for hemorrhagic transformation classification; HI1, hemorrhagic infarction 1; HI2, hemorrhagic infarction 2; PH1, parenchymal hematoma 1; PH2, parenchymal hematoma 2.</p>
</table-wrap-foot>
</table-wrap>
<p>The incidence of HT in our ischemic stroke cohort was 19.8% (70 out of 354 patients). No significant difference between the training and testing datasets was observed regarding HT type. The percentages of HI1, HI2, PH1, and PH2 were 7.3, 8.5, 4.5, and 2.3% in the training dataset and 4.0, 6.8, 4.0, and 2.3% in the testing dataset, respectively.</p>
<p>Using the mRMR variable selection algorithm in the training dataset, it was possible to identify CMB, NIHSS, and infarction size as the most informative variables (<xref ref-type="fig" rid="F1">Figure 1</xref>), and they were used to build HT prediction models. Univariate analysis between HT positive and negative cases in the training dataset confirmed the results obtained by mRMR (<xref ref-type="supplementary-material" rid="SM4">Supplementary Table 2</xref>). The only significant difference was observed for CMB, NIHSS, and infarction size.</p>
<fig id="F1" position="float">
<label>Figure 1</label>
<caption><p>Score matrix of all studied variables assessed by the mRMRe (minimum redundancy maximum relevance) algorithm provided by the varrank R package. The first column represents the relevance scores of different variables assessed by the mutual information algorithm relative to the HT incidence and ranked in a descending manner. Subsequent columns represent the difference between the relevance and redundancy scores of each variable after adding it to the previously selected variable. Positive scores indicate higher relevance than redundancy scores and were color-coded by a scale from yellow to red, whereas negative scores indicate higher redundancy than relevance scores and were color-coded by a scale from aqua to deep blue. Zero scores were colored green.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fneur-13-951401-g0001.tif"/>
</fig></sec>
<sec>
<title>Comparison of HT predicting models</title>
<p>The learning curves of different models are presented in <xref ref-type="supplementary-material" rid="SM1">Supplementary Figure 1</xref>. Learning curves present mean accuracy scores (&#x000B1;1 SD) as a function of the sample size used for cross-validation. The learning curve of GBC exhibited an overfitting pattern to the training dataset. However, the performance of GBC in the testing dataset was satisfactory. For RFC and SVC, the learning curves were not significantly lower in the testing dataset compared to the training dataset. For sample sizes &#x0003E;80, RFC and SVC did not exhibit major signs of bias or overfitting as suggested by the difference between the training and testing datasets. The learning curves of LRC and MLPC seem to get closer to accuracies lower than RFC, GBC, and SVC. Also, LRC and MLPC curves exhibited higher variability as suggested by the apparently larger &#x000B1;SD intervals and crossing of training and testing curves, which may explain their low AUC metrics.</p>
<p><xref ref-type="table" rid="T2">Table 2</xref> summarizes the performance metrics in the testing dataset. RFC (AUC: 0.91, 95% confidence interval (95% CI): 0.85&#x02013;0.95) and GBC (AUC: 0.91, 95% CI: 0.86&#x02013;0.95) demonstrated significantly superior performance compared to LRC (AUC: 0.85, 95% CI: 0.79&#x02013;0.91) and MLPC (AUC: 0.85, 95% CI: 0.78&#x02013;0.92). Although SVC (AUC: 0.90, 95% CI: 0.85&#x02013;0.94) outperformed LRC and MLPC, it did not reach statistical significance. LRC and MLPC did not show significant differences. <xref ref-type="fig" rid="F2">Figure 2</xref> demonstrates the overall performance of different models in the training and testing datasets as represented by ROC curves. Using classification thresholds determined by the Youden indices identified in the training dataset, the observed classification accuracy in the testing dataset was 0.78 for FRC, 0.76 for GBC, 0.80 for SVC, 0.80 for LRC, and 0.84 for MLPC.</p>
<table-wrap position="float" id="T2">
<label>Table 2</label>
<caption><p>Predictive performance metrics of different models in the testing dataset.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th valign="top" align="left"><bold>Model</bold></th>
<th valign="top" align="center"><bold>Sensitivity</bold></th>
<th valign="top" align="center"><bold>Specificity</bold></th>
<th valign="top" align="center"><bold>PPV</bold></th>
<th valign="top" align="center"><bold>NPV</bold></th>
<th valign="top" align="center"><bold>AUC (95%CI)<xref ref-type="table-fn" rid="TN3"><sup>&#x00023;</sup></xref></bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">RFC</td>
<td valign="top" align="center">0.97</td>
<td valign="top" align="center">0.74</td>
<td valign="top" align="center">0.43</td>
<td valign="top" align="center">0.99</td>
<td valign="top" align="center">0.91 (0.85&#x02013;0.95)<xref ref-type="table-fn" rid="TN4"><sup>&#x0002A;</sup></xref></td>
</tr>
<tr>
<td valign="top" align="left">GBC</td>
<td valign="top" align="center">0.97</td>
<td valign="top" align="center">0.72</td>
<td valign="top" align="center">0.41</td>
<td valign="top" align="center">0.99</td>
<td valign="top" align="center">0.91 (0.86&#x02013;0.95)<xref ref-type="table-fn" rid="TN5"><sup>&#x000A7;</sup></xref></td>
</tr>
<tr>
<td valign="top" align="left">SVC</td>
<td valign="top" align="center">0.83</td>
<td valign="top" align="center">0.79</td>
<td valign="top" align="center">0.45</td>
<td valign="top" align="center">0.96</td>
<td valign="top" align="center">0.90 (0.85&#x02013;0.94)<xref ref-type="table-fn" rid="TN6"><sup>&#x000A5;</sup></xref></td>
</tr>
<tr>
<td valign="top" align="left">LRC</td>
<td valign="top" align="center">0.70</td>
<td valign="top" align="center">0.82</td>
<td valign="top" align="center">0.45</td>
<td valign="top" align="center">0.93</td>
<td valign="top" align="center">0.85 (0.79&#x02013;0.91)<xref ref-type="table-fn" rid="TN7"><sup>&#x0002B;</sup></xref></td>
</tr>
<tr>
<td valign="top" align="left">MLPC</td>
<td valign="top" align="center">0.57</td>
<td valign="top" align="center">0.89</td>
<td valign="top" align="center">0.52</td>
<td valign="top" align="center">0.91</td>
<td valign="top" align="center">0.85 (0.78&#x02013;0.92)<xref ref-type="table-fn" rid="TN7"><sup>&#x0002B;</sup></xref></td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="TN3"><label>&#x00023;</label><p>AUC was the metric used for the evaluation and comparison of models performance because it is less sensitive to data imbalance.</p></fn>
<fn id="TN4"><label>&#x0002A;</label><p>Significantly higher than LRC (<italic>p</italic>-value = 0.021) and MLPC (<italic>p</italic>-value = 0.007) but not GBC (<italic>p</italic>-value = 0.786) and SVC (<italic>p</italic>-value = 0.286).</p></fn>
<fn id="TN5"><label>&#x000A7;</label><p>Significantly higher than LRC (<italic>p</italic>-value = 0.021) and MLPC (<italic>p</italic>-value = 0.012) but not SVC (<italic>p</italic>-value = 0.270).</p></fn>
<fn id="TN6"><label>&#x000A5;</label><p>Not significantly higher than LRC (<italic>p</italic>-value = 0.054) and MLPC (<italic>p</italic>-value = 0.056).</p></fn>
<fn id="TN7"><label>&#x0002B;</label><p>Was not significantly different from each other (<italic>p</italic>-value = 1).</p></fn>
<p>The MLPC demonstrated very low sensitivity and consequently limited clinical utility and was not considered for 3D model prediction analysis. The classification threshold could be adjusted to modify the misclassification rates (false positive and negative rates) and performance metrics according to the clinical need.</p>
<p>SVC, support vector classifier; GBC, gradient boosting classifier; LRC, logistic regression classifier; RFC, random forest classifier; MLPC, multilayer perceptron classifier; PPV, positive predictive value; NPV, negative predictive value.</p>
</table-wrap-foot>
</table-wrap>
<fig id="F2" position="float">
<label>Figure 2</label>
<caption><p>Comparison of the utilized machine learning models&#x00027; overall performance using the AUC &#x000B1; 95% CI metric. Youden indices were estimated using the maximum sensitivity plus specificity. The RFC and GBC models demonstrated significantly larger AUC compared to LRC and MLPC but with no statistical difference between each other and SVC. AUC, area under curve; CI, confidence interval; SVC, support vector classifier; GBC, gradient boosting classifier; LRC, logistic regression classifier; RFC, random forest classifier; MLPC, multilayer perceptron classifier.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fneur-13-951401-g0002.tif"/>
</fig>
<p>To examine whether adding more predictors of HT incidence could improve model performance, we compared the AUC of models developed by all the studied 16 variables to models developed by the mRMR selected three variables. Except for the three-variable SVC model (AUC: 0.90, 95% CI: 0.85&#x02013;0.94), which significantly surpassed that of the 16-variable model (AUC: 0.82, 95% CI: 0.73&#x02013;0.90), the AUC of the 16-variable RFC (0.91, 95% CI: 0.87&#x02013;0.96), GBC (0.91, 95% CI: 0.86&#x02013;0.95), LRC (0.84, 95% CI: 0.77&#x02013;0.91), and MLPC (0.85, 95% CI: 0.78&#x02013;0.92) were not significantly different from those of the three-variable models (<xref ref-type="supplementary-material" rid="SM2">Supplementary Figure 2</xref>).</p></sec>
<sec>
<title>The best performing models 3D predicted spaces suggest the interaction between predictors</title>
<p><xref ref-type="fig" rid="F3">Figure 3</xref> shows the 3D predictive space of different models relative to the true feature space. The prediction space of the superior models, RFC, and GBC clearly demonstrated a non-linear relationship and plausible interaction between the predictor variables. In contrast, the lower performance, LRC, and 3D predicted space demonstrated a linear relationship between predictors. This result suggested that failure to detect a non-linear relationship between variables was associated with lower model performance.</p>
<fig id="F3" position="float">
<label>Figure 3</label>
<caption><p>3D figure shows the predicted spaces of each ML model within the true feature space. The green area represents the positive HT prediction, while the non-colored area represents the negative prediction. The blue and red dots represent the observed positive and negative HT cases, respectively (the points inside the green predicted space are not visible). The blue dots within the green and clear areas represent true positive and false negative predictions, respectively. The red dots within the green and clear areas represent false positive and true negative predictions, respectively. The best performing models, RFC, GBC, and SVC reveal non-linear decision boundaries indicative of the interaction between the three predictors. At a particular value of NIHSS, observe the reduction of infarction size needed for HT prediction to be green (positive) as the number of CMB increases. Similarly, at a particular value of CMB, observe the reduction of infarction size needed for HT prediction to be green as the NIHSS score increases. LRC did not capture the non-linear relationships and as such, it failed to model the interaction between predictors. The MLPC was not considered for 3D model presentation because of its very low sensitivity. SVC, support vector classifier; GBC, gradient boosting classifier; LRC, logistic regression classifier; RFC, random forest classifier; MLPC, multilayer perceptron classifier.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fneur-13-951401-g0003.tif"/>
</fig></sec>
<sec>
<title>Non-linear interaction of NIHSS with CMB and infarction size</title>
<p>To further investigate the suggested non-linear interaction between HT predictors, we fitted several generalized additive models, all with logit link function, with interaction terms, smooth plate regression splines, and tensor products to the whole dataset. <xref ref-type="table" rid="T3">Table 3</xref> demonstrates that adding interaction terms significantly (&#x003C7;<sup>2</sup> = &#x0003C; 0.001) improved the performance and resulted in 21 unit reduction of the Akaike information criterion (AIC) score compared to the single-term model. All the coefficients of single and interaction terms were significant. Fitting thin plate regression splines instead of interaction terms has significantly improved model fitting (&#x003C7;<sup>2</sup> = &#x0003C; 0.001) and resulted in 16 unit reduction of AIC compared to the interaction term model. Because non-linearity could lead to spurious variable interaction, we fitted models with smoothed predictor and tensor interaction (ti) terms that could delineate the interaction component from the main effect. Fitting product spline significantly improved model performance (&#x003C7;<sup>2</sup> = &#x0003C; 0.02) and induced around 3 unit reduction of AIC score compared to models with smoothed splines only. The model revealed the existence of significant non-linear interaction between NIHSS and CMB and between NIHSS and infarction size with effective degrees of freedom (EDF) of 4.6 and 3.1, respectively. <xref ref-type="fig" rid="F4">Figure 4</xref> shows the partial probability of HT incidence as a function of infarction size and conditioned on NIHSS score (scores of 2, 8, and 17) and CMB (3, 10, and 15). As illustrated in <xref ref-type="fig" rid="F4">Figure 4A</xref> the traditional LR model with single terms produced monotonous NIHSS curves that demonstrate similar parametric functions with infarction size at different CMB levels except for an absolute effect representing different intercepts. This pattern suggests a failure to detect any predictor interaction. In contrast, the model fitted with tensor products, <xref ref-type="fig" rid="F4">Figure 4B</xref>, demonstrated non-linear HT prediction probabilities as reflected by NIHSS curves across different levels of CMB and infarction size. For example, at a CMB level of 3, patients with a low NIHSS score of 2 started to respond at an infarction size of around 3.5 and exhibited sharp dependency on infarction size in contrast to patients with an NIHSS of 17, which started to respond at an infarction size of zero but with less dependency on infarction size. At CMB 15, the curve of the NIHSS score of 2 almost reached saturation, whereas the curve of the NIHSS score of 17 almost becomes linear with a low slope demonstrating less dependency on infarction size. The results presented in <xref ref-type="table" rid="T3">Table 3</xref> and <xref ref-type="fig" rid="F4">Figure 4</xref> lend support to the suggested predictor interaction presented in <xref ref-type="fig" rid="F3">Figure 3</xref>.</p>
<table-wrap position="float" id="T3">
<label>Table 3</label>
<caption><p>Fitted GAM models to examine predictors linearity and interaction.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th valign="top" align="left"><bold>Model</bold></th>
<th valign="top" align="center"><bold>Coefficient (SE)</bold></th>
<th valign="top" align="center"><italic><bold>p</bold></italic><bold>-value</bold></th>
<th valign="top" align="center"><bold>EDF</bold></th>
<th valign="top" align="center"><bold>Adjusted <italic>R</italic><sup>2</sup></bold></th>
<th valign="top" align="center"><bold>Deviance</bold></th>
<th valign="top" align="center"><italic><bold>p</bold></italic><bold>&#x003C7;2</bold></th>
<th valign="top" align="center"><bold>AIC</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left" colspan="8"><bold>Single term model</bold></td>
</tr>
<tr>
<td valign="top" align="left">HT = &#x003B2;<sub>1</sub>CMB &#x0002B; &#x003B2;<sub>2</sub>NIHSS &#x0002B; &#x003B2;<sub>3</sub> Size</td>
<td valign="top" align="center">&#x003B2;<sub>1</sub>= 0.24 (0.03)</td>
<td valign="top" align="center">&#x0003C; 0.001</td>
<td/>
<td valign="top" align="center">0.33</td>
<td/>
<td valign="top" align="center">Reference</td>
<td valign="top" align="center">239.4</td>
</tr>
<tr>
<td/>
<td valign="top" align="center">&#x003B2;<sub>2</sub>= 0.12 (0.03)</td>
<td valign="top" align="center">&#x0003C; 0.001</td>
<td/>
<td/>
<td/>
<td/>
<td/>
</tr>
<tr>
<td/>
<td valign="top" align="center">&#x003B2;<sub>3</sub>= 0.48 (0.09)</td>
<td valign="top" align="center">&#x0003C; 0.001</td>
<td/>
<td/>
<td/>
<td/>
<td/>
</tr>
<tr>
<td valign="top" align="left" colspan="8"><bold>Interaction term model</bold></td>
</tr>
<tr>
<td valign="top" align="left">HT = CMB &#x0002B; NIHSS &#x0002B; Size &#x0002B; NIHSS&#x0002A;Size &#x0002B; CMB&#x0002A;NIHSS &#x0002B; CMB&#x0002A;Size &#x0002B; CMB&#x0002A;NIHSS&#x0002A;Size</td>
<td valign="top" align="center">&#x003B2;<sub>1</sub>= 0.98 (0.19)</td>
<td valign="top" align="center">&#x0003C; 0.001</td>
<td/>
<td valign="top" align="center">0.40</td>
<td valign="top" align="center">29.6<xref ref-type="table-fn" rid="TN8"><sup>&#x000B1;</sup></xref></td>
<td valign="top" align="center">&#x0003C; 0.001<xref ref-type="table-fn" rid="TN8"><sup>&#x000B1;</sup></xref></td>
<td valign="top" align="center">217.9</td>
</tr>
<tr>
<td/>
<td valign="top" align="center">&#x003B2;<sub>2</sub>= 0.47 (0.10)</td>
<td valign="top" align="center">&#x0003C; 0.001</td>
<td/>
<td/>
<td/>
<td/>
<td/>
</tr>
<tr>
<td/>
<td valign="top" align="center">&#x003B2;<sub>3</sub>= 1.84 (0.40)</td>
<td valign="top" align="center">&#x0003C; 0.001</td>
<td/>
<td/>
<td/>
<td/>
<td/>
</tr>
<tr>
<td/>
<td valign="top" align="center">&#x003B2;<sub>4</sub>= &#x02212;0.04 (0.01)</td>
<td valign="top" align="center">&#x0003C; 0.001</td>
<td/>
<td/>
<td/>
<td/>
<td/>
</tr>
<tr>
<td/>
<td valign="top" align="center">&#x003B2;<sub>5</sub>= &#x02212;0.17 (0.06)</td>
<td valign="top" align="center">&#x0003C; 0.01</td>
<td/>
<td/>
<td/>
<td/>
<td/>
</tr>
<tr>
<td/>
<td valign="top" align="center">&#x003B2;<sub>6</sub>= &#x02212;0.06 (0.02)</td>
<td valign="top" align="center">&#x0003C; 0.01</td>
<td/>
<td/>
<td/>
<td/>
<td/>
</tr>
<tr>
<td/>
<td valign="top" align="center">&#x003B2;<sub>7</sub> = 0.007 (0.003)</td>
<td valign="top" align="center">&#x0003C; 0.05</td>
<td/>
<td/>
<td/>
<td/>
<td/>
</tr>
<tr>
<td valign="top" align="left" colspan="8"><bold>Spline fitted model</bold></td>
</tr>
<tr>
<td valign="top" align="left">HT = &#x003B2;<sub>1</sub>CMB &#x0002B; &#x003B2;<sub>2</sub>NIHSS &#x0002B; &#x003B2;<sub>3</sub>Size&#x0002B;</td>
<td valign="top" align="center">&#x003B2;<sub>1</sub> = 0.47 (0.2)</td>
<td valign="top" align="center">&#x0003C; 0.05</td>
<td valign="top" align="center">1.6</td>
<td valign="top" align="center">0.50</td>
<td valign="top" align="center">56.8<xref ref-type="table-fn" rid="TN8"><sup>&#x000B1;</sup></xref></td>
<td valign="top" align="center">&#x0003C; 0.001<xref ref-type="table-fn" rid="TN8"><sup>&#x000B1;</sup></xref></td>
<td valign="top" align="center">201.5</td>
</tr>
<tr>
<td valign="top" align="left">s(CMB) &#x0002B; s(NIHSS) &#x0002B; s(Size)</td>
<td valign="top" align="center">&#x003B2;<sub>2</sub> = &#x02212;0.52 (0.14)</td>
<td valign="top" align="center">&#x0003C; 0.001</td>
<td valign="top" align="center">6.8</td>
<td/>
<td valign="top" align="center">27.3<xref ref-type="table-fn" rid="TN9"><sup>&#x000A7;</sup></xref></td>
<td valign="top" align="center">&#x0003C; 0.001<xref ref-type="table-fn" rid="TN9"><sup>&#x000A7;</sup></xref></td>
<td/>
</tr>
<tr>
<td/>
<td valign="top" align="center">&#x003B2;<sub>3</sub> = 1.14 (0.65)</td>
<td valign="top" align="center">0.08</td>
<td valign="top" align="center">2.0</td>
<td/>
<td/>
<td/>
<td/>
</tr>
<tr>
<td/>
<td valign="top" align="center">s(CMB)</td>
<td valign="top" align="center">&#x0003C; 0.05</td>
<td/>
<td/>
<td/>
<td/>
<td/>
</tr>
<tr>
<td/>
<td valign="top" align="center">s(NIHSS)</td>
<td valign="top" align="center">&#x0003C; 0.001</td>
<td/>
<td/>
<td/>
<td/>
<td/>
</tr>
<tr>
<td/>
<td valign="top" align="center">s(Size)</td>
<td valign="top" align="center">0.09</td>
<td/>
<td/>
<td/>
<td/>
<td/>
</tr>
<tr>
<td valign="top" align="left" colspan="8"><bold>Tensor product model</bold></td>
</tr>
<tr>
<td valign="top" align="left">HT= s(CMB)&#x0002B; s(NIHSS)&#x0002B; s(Size)&#x0002B;</td>
<td valign="top" align="center">s(CMB)</td>
<td valign="top" align="center">&#x0003C; 0.001</td>
<td valign="top" align="center">2.6</td>
<td valign="top" align="center">0.52</td>
<td valign="top" align="center">66.6<xref ref-type="table-fn" rid="TN8"><sup>&#x000B1;</sup></xref></td>
<td valign="top" align="center">&#x0003C; 0.001<xref ref-type="table-fn" rid="TN8"><sup>&#x000B1;</sup></xref></td>
<td valign="top" align="center">197.9</td>
</tr>
<tr>
<td valign="top" align="left">ti(CMB, NIHSS)&#x0002B; ti(CMB, Size)&#x0002B;</td>
<td valign="top" align="center">s(NIHSS)</td>
<td valign="top" align="center">&#x0003C; 0.001</td>
<td valign="top" align="center">1.0</td>
<td/>
<td valign="top" align="center">37.0<xref ref-type="table-fn" rid="TN9"><sup>&#x000A7;</sup></xref></td>
<td valign="top" align="center">&#x0003C; 0.001<xref ref-type="table-fn" rid="TN9"><sup>&#x000A7;</sup></xref></td>
<td/>
</tr>
<tr>
<td valign="top" align="left">ti(NIHSS, Size)</td>
<td valign="top" align="center">s(Size)</td>
<td valign="top" align="center">&#x0003C;0.001</td>
<td valign="top" align="center">1.0</td>
<td/>
<td valign="top" align="center">9.7<xref ref-type="table-fn" rid="TN10"><sup>&#x000A5;</sup></xref></td>
<td valign="top" align="center">&#x0003C;0.05<xref ref-type="table-fn" rid="TN10"><sup>&#x000A5;</sup></xref></td>
<td/>
</tr>
<tr>
<td/>
<td valign="top" align="center">ti(CMB,NIHSS)</td>
<td valign="top" align="center">&#x0003C;0.01</td>
<td valign="top" align="center">4.6</td>
<td/>
<td/>
<td/>
<td/>
</tr>
<tr>
<td/>
<td valign="top" align="center">ti(CMB,Size)</td>
<td valign="top" align="center">0.19</td>
<td valign="top" align="center">1.0</td>
<td/>
<td/>
<td/>
<td/>
</tr>
<tr>
<td/>
<td valign="top" align="center">ti(NIHSS,Size)</td>
<td valign="top" align="center">&#x0003C; 0.001</td>
<td valign="top" align="center">3.1</td>
<td/>
<td/>
<td/>
<td/>
</tr>
</tbody>
</table>
<table-wrap-foot>
<fn id="TN8"><label>&#x000B1;</label><p>Compared to single-term model.</p></fn>
<fn id="TN9"><label>&#x000A7;</label><p>Compared to interaction term model.</p></fn>
<fn id="TN10"><label>&#x000A5;</label><p>Compared to tensor product model.</p></fn>
<p>Size in the model refers to infarction size. Effective degrees of freedom (EDF) &#x0003E; 1 indicates non-linearity. In the final model, the smooth CMB, the product tensor of CMB and NIHSS, and the product tensor of NIHSS and infarction size were significantly no-nlinear. Intercept was not included in the table.</p>
</table-wrap-foot>
</table-wrap>
<fig id="F4" position="float">
<label>Figure 4</label>
<caption><p>Partial probability of HT incidence as a function of infarction size and conditioned on NIHSS score and CMB count. <bold>(A,B)</bold> show logistic regression fitted with single terms and a generalized additive model fitted with thin plate regression splines with tensor product terms, respectively. Tensor product terms could delineate the interaction component from the main effect. <bold>(A)</bold> shows monotonous NIHSS curves that demonstrate similar parametric functions with infarction size at different CMB levels except for an absolute effect representing different intercepts. This logistic regression pattern suggests a failure to detect predictor interaction and hence predictive power. In contrast, <bold>(B)</bold> demonstrates non-linear HT predicting functions as reflected by NIHSS curves across different levels of CMB and infarction size. For example, at a CMB level of 3, patients with a low NIHSS score of 2 started to respond at an infarction size of around 3.5 and exhibited sharp dependency on infarction size in contrast to patients with an NIHSS of 17, which started to respond at an infarction size of zero but with less dependency on infarction size. At CMB 15, the curve of the NIHSS score of 2 almost reached saturation, whereas the curve of the NIHSS score of 17 almost becomes linear with a low slope demonstrating less dependency on infarction size.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fneur-13-951401-g0004.tif"/>
</fig></sec></sec>
<sec sec-type="discussion" id="s4">
<title>Discussion</title>
<p>The objective of the current prospective study was to determine the incidence and predictors of HT, investigate the interaction between predictors, and identify the optimal predicting models. HT incidence in our study was 19.8%, which was in agreement with other studies (<xref ref-type="bibr" rid="B35">35</xref>, <xref ref-type="bibr" rid="B36">36</xref>). To produce a parsimonious model with enhanced interpretability, improved accuracy, and enhanced generalizability, we utilized an mRMR algorithm capable of measuring the amount of information between variables (<xref ref-type="bibr" rid="B37">37</xref>). It was possible to identify three risk factors namely NIHSS, CMB, and infarction size as potential HT predictors in the training dataset. Detecting the small number of risk factors associated with HT is not unusual. Terruso et al. (<xref ref-type="bibr" rid="B38">38</xref>) reported that infarction size was the only significant risk factor associated with HT in their study. Tan et al. (<xref ref-type="bibr" rid="B35">35</xref>) found that only infarction size and atrial fibrillation were significantly associated with HT. We could not find sufficient mutual information between HT and age, sex, DM, hypertension, dyslipidemia, platelet count, previous use of antiplatelet, and previous stroke attacks. This finding echoed the results from previous studies (<xref ref-type="bibr" rid="B15">15</xref>, <xref ref-type="bibr" rid="B36">36</xref>, <xref ref-type="bibr" rid="B38">38</xref>, <xref ref-type="bibr" rid="B39">39</xref>).</p>
<p>Random forest classifier, GBC, SVC, LRC, and MLPC models were developed using the three identified predictors. The validity of NIHSS, CMB, and infarction size to predict HT incidence was verified by comparison with models developed with all the studied variables. The predictive performance of RFC and GBC models significantly outperformed LRC and MLPC as assessed by the AUC difference. The superior predictive performance of GBC models over classical regression was previously documented in a large study conducted to predict severe complications after acute ischemic stroke (<xref ref-type="bibr" rid="B8">8</xref>). In general, the performance metrics of the best ML models in our study were either better or comparable to previous models developed to predict HT incidence in previous studies taking into consideration differences in the utilized HT definition, study design, ethnicity, and sample size of the study group (<xref ref-type="bibr" rid="B40">40</xref>, <xref ref-type="bibr" rid="B41">41</xref>).</p>
<p>Infarction size and NIHSS were the most frequently reported risk factors associated with HT either in retrospective or prospective studies (<xref ref-type="bibr" rid="B15">15</xref>, <xref ref-type="bibr" rid="B38">38</xref>, <xref ref-type="bibr" rid="B42">42</xref>). Either or both risk factors were commonly included in the previous HT prediction scales in addition to other variables (<xref ref-type="bibr" rid="B40">40</xref>&#x02013;<xref ref-type="bibr" rid="B42">42</xref>). CMB was shown by some recent prospective studies to predict the incidence of future hemorrhagic transformation and intracerebral hemorrhage (<xref ref-type="bibr" rid="B43">43</xref>, <xref ref-type="bibr" rid="B44">44</xref>). Investigating the optimized model predicted spaces within the true feature/variable space suggested the interaction between predictors as reflected by the non-linear surface as a function of more than one variable. The interaction between predictors was investigated using generalized regression models fitted with smooth splines and tensor products. The results demonstrated the existence of non-linear interaction between predictors and resulted in significant improvement in explaining HT variance compared with models with no interaction terms. This result suggests that there is no fixed cutoff value for any of these predictors after which the risk of hemorrhagic transformation would increase. As shown in <xref ref-type="fig" rid="F4">Figure 4B</xref>, the significance of any of these risk factors to hemorrhagic transformation is changing depending on the values of other risk factors. For example, at a particular value of NIHSS, the value of infarction size needed to increase the risk of HT is reduced if the number of CMB increases. Therefore, the best HT predicting models were those capable of capturing the non-linear relations by algorithms such as RFC and GBC compared to classical LRC.</p>
<p>The interaction implies that NIHSS, CMB, and infarction size could be integrating both similar and distinct upstream pathophysiological mechanisms that collectively could enhance HT incidence. For example, CMB was reported to be significantly associated with advanced age and cerebral small vessel impairment/diseases that occur in hypertension, cerebral amyloid angiopathy, and chronic kidney disease (<xref ref-type="bibr" rid="B45">45</xref>&#x02013;<xref ref-type="bibr" rid="B48">48</xref>). The clinical syndromes that have been associated with CMB, such as cognitive impairment, dementia, and recurrent ischemic stroke, further reflect the underlying functional impairment of cerebral small vessels (<xref ref-type="bibr" rid="B49">49</xref>). Severe NIHSS score, on the other hand, could integrate risk factors such as old age, female gender, small vessel diseases, and mental and psychological stress of social isolation, and cardiac diseases such as AF, low ejection fraction, heart failure, and cardioembolism. All those risk factors were reported to be significantly associated with NIHSS (<xref ref-type="bibr" rid="B48">48</xref>, <xref ref-type="bibr" rid="B50">50</xref>&#x02013;<xref ref-type="bibr" rid="B52">52</xref>).</p>
<p>Several mechanisms could be envisioned to explain the observed interaction between CMB, infarction size, and NIHSS score. One plausible mechanism is <italic>via</italic> cerebral small vessel disease, which could increase both the CMB burden and NIHSS score (<xref ref-type="bibr" rid="B48">48</xref>). Cerebral small vessel disease was also reported to be significantly associated with poor collateral recruitment, which could be the underlying mechanism of the observed interaction between CMB, NIHSS, and infarction size. Reduced perfusion from poor collaterals is a determining factor in controlling infarction size, stroke risk, and poor functional outcome (<xref ref-type="bibr" rid="B53">53</xref>). Furthermore, cerebral small vessel disease was also reported to be significantly associated with impaired cerebral autoregulation. A growing body of evidence links impaired cerebral autoregulation with the incidence of HT and acute ischemic stroke (<xref ref-type="bibr" rid="B54">54</xref>, <xref ref-type="bibr" rid="B55">55</xref>). Genetic susceptibility and APOE &#x003B5;4 genotype have been reported to be significantly associated with CMB and also with enhanced susceptibility to cerebrovascular insults and, thus, large infarction size (<xref ref-type="bibr" rid="B45">45</xref>, <xref ref-type="bibr" rid="B56">56</xref>).</p>
<p>The strength of our study stems from utilizing a prospective study design, using a filter-based, rather than wrapper-based, variable selection technique independent of any model fitting process, and utilizing ML models capable of detecting complex non-linear relationships in a totally na&#x000EF;ve testing dataset. In wrapper-based variable selection methods, variables are selected iteratively based on the performance of the fitted model, which could result in models that produce the best performance only with the utilized dataset.</p>
<p>The limitations of our study include the involvement of only one academic hospital and one race. Also, our sample size was relatively small, which could limit the power of our study to detect other risk factors associated with HT incidence. Another source of limitation was the exclusion of patients who received rtPA therapy. Unfortunately, the majority of patients fail to present to our emergency room within the rtPA therapeutic window (<xref ref-type="bibr" rid="B57">57</xref>). We thought that the inclusion of those cases could bias results. The death of six subjects before taking the second MRI and hence determination of HT incidence could be a potential source of bias especially if they represented severe cases. In addition, we did not include ASPECTS and collateral scores in our analysis because they were performed for patients on rtPA therapy and not routinely for any patient. The observed moderate PPV suggests the need to include further biological and imaging markers to limit false positive cases. Our results need to be replicated in patients on rtPA therapy to examine the difference in risk profiles and predicting models.</p></sec>
<sec sec-type="conclusions" id="s5">
<title>Conclusion</title>
<p>National Institute of Health stroke scale, infarction size, and CMB were the best HT predictors. The observed interaction between predictors suggests a dynamic, rather than fixed, cutoff risk value for any of these predictors. The best HT predicting models were demonstrated by algorithms capable of capturing non-linear relations such as RFC and GBC compared to the classical LRC. Prediction of HT susceptibility could facilitate designing management plans for high-risk patients.</p></sec>
<sec sec-type="data-availability" id="s6">
<title>Data availability statement</title>
<p>The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.</p></sec>
<sec id="s7">
<title>Ethics statement</title>
<p>The studies involving human participants were reviewed and approved by Local Ethics Committee of Neurology Department, Zagazig University Hospitals &#x00023; 332/2018. The patients/participants or their legal guardians provided their written informed consent to participate in this study.</p></sec>
<sec id="s8">
<title>Author contributions</title>
<p>AE, RF, NS, and BR: conceptualization and data curation, methodology, and first draft. AE: formal analysis and final version. All authors have read and approved the final manuscript.</p></sec>
<sec sec-type="COI-statement" id="conf1">
<title>Conflict of interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p></sec>
<sec sec-type="disclaimer" id="s9">
<title>Publisher&#x00027;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p></sec></body>
<back>
<sec sec-type="supplementary-material" id="s10">
<title>Supplementary material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fneur.2022.951401/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fneur.2022.951401/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Image_1.tif" id="SM1" mimetype="image/tif" xmlns:xlink="http://www.w3.org/1999/xlink">
<label>Supplementary Figure 1</label>
<caption><p>Learning curves present mean accuracy scores (&#x000B1;1 SD) as a function of the sample size used for cross-validation. The learning curve of GBC exhibited an overfitting pattern to the training dataset. However, the performance of GBC in the testing dataset was satisfactory. For RFC and SVC, the learning curves were not significantly lower in the testing dataset compared to the training dataset. For sample sizes &#x0003E;80, RFC and SVC did not exhibit major signs of bias or overfitting as suggested by the difference between the training and testing datasets. The learning curves of LRC and MLPC seem to get closer to accuracies lower than RFC, GBC, and SVC. Also, LRC and MLPC curves exhibited higher variability as suggested by the apparently larger &#x000B1; SD intervals and crossing of training and testing curves, which may explain their low AUC metrics.</p></caption></supplementary-material>
<supplementary-material xlink:href="Image_2.tif" id="SM2" mimetype="image/tif" xmlns:xlink="http://www.w3.org/1999/xlink">
<label>Supplementary Figure 2</label>
<caption><p>Area under the ROC curve (AUC) comparison of the three-variable and 16-variable models. Except for the three-variable SVC, which significantly surpassed that of the 16-variable model, the AUC of the 16-variable RFC, GBC, LRC, and MLPC was not significantly different from those of the three-variable models.</p></caption></supplementary-material>
<supplementary-material xlink:href="Table_1.docx" id="SM3" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document" xmlns:xlink="http://www.w3.org/1999/xlink">
<label>Supplementary Table 1</label>
<caption><p>Tuned parameters of five models.</p></caption></supplementary-material>
<supplementary-material xlink:href="Table_2.docx" id="SM4" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document" xmlns:xlink="http://www.w3.org/1999/xlink">
<label>Supplementary Table 2</label>
<caption><p>Univariate analysis of risk factors associated with HT incidence in the training datasets.</p></caption></supplementary-material></sec>
<ref-list>
<title>References</title>
<ref id="B1">
<label>1.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lindley</surname> <given-names>RI</given-names></name> <name><surname>Wardlaw</surname> <given-names>JM</given-names></name> <name><surname>Sandercock</surname> <given-names>PA</given-names></name> <name><surname>Rimdusid</surname> <given-names>P</given-names></name> <name><surname>Lewis</surname> <given-names>SC</given-names></name> <name><surname>Signorini</surname> <given-names>DF</given-names></name> <etal/></person-group>. <article-title>Frequency and risk factors for spontaneous hemorrhagic transformation of cerebral infarction</article-title>. <source>J Stroke Cerebrovasc Dis.</source> (<year>2004</year>) <volume>13</volume>:<fpage>235</fpage>&#x02013;<lpage>46</lpage>. <pub-id pub-id-type="doi">10.1016/j.jstrokecerebrovasdis.2004.03.003</pub-id><pub-id pub-id-type="pmid">17903981</pub-id></citation></ref>
<ref id="B2">
<label>2.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jaillard</surname> <given-names>A</given-names></name> <name><surname>Cornu</surname> <given-names>C</given-names></name> <name><surname>Durieux</surname> <given-names>A</given-names></name> <name><surname>Moulin</surname> <given-names>T</given-names></name> <name><surname>Boutitie</surname> <given-names>F</given-names></name> <name><surname>Lees</surname> <given-names>KR</given-names></name> <name><surname>Hommel</surname> <given-names>M</given-names></name></person-group>. <article-title>Hemorrhagic transformation in acute ischemic stroke. The MAST-E study. MAST-E Group</article-title>. <source>Stroke</source>. (<year>1999</year>) <volume>30</volume>:<fpage>1326</fpage>&#x02013;<lpage>32</lpage>. <pub-id pub-id-type="doi">10.1161/01.STR.30.7.1326</pub-id><pub-id pub-id-type="pmid">10390303</pub-id></citation></ref>
<ref id="B3">
<label>3.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lei</surname> <given-names>C</given-names></name> <name><surname>Wu</surname> <given-names>B</given-names></name> <name><surname>Liu</surname> <given-names>M</given-names></name> <name><surname>Chen</surname> <given-names>Y</given-names></name></person-group>. <article-title>Asymptomatic hemorrhagic transformation after acute ischemic stroke: is it clinically innocuous?</article-title> <source>J Stroke Cerebrovasc Dis</source>. (<year>2014</year>) <volume>23</volume>:<fpage>2767</fpage>&#x02013;<lpage>72</lpage>. <pub-id pub-id-type="doi">10.1016/j.jstrokecerebrovasdis.2014.06.024</pub-id><pub-id pub-id-type="pmid">25314946</pub-id></citation></ref>
<ref id="B4">
<label>4.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Andrade</surname> <given-names>JBC</given-names></name> <name><surname>Mohr</surname> <given-names>JP</given-names></name> <name><surname>Lima</surname> <given-names>FO</given-names></name> <name><surname>de Carvalho</surname> <given-names>JJF</given-names></name> <name><surname>Barros</surname> <given-names>LCM</given-names></name> <name><surname>Nepomuceno</surname> <given-names>CR</given-names></name> <name><surname>Ferrer</surname> <given-names>JVCC</given-names></name> <name><surname>Silva</surname> <given-names>GS</given-names></name></person-group>. <article-title>The role of hemorrhagic transformation in acute ischemic stroke upon clinical complications and outcomes</article-title>. <source>J Stroke Cerebrovasc Dis</source>. (<year>2020</year>) <volume>29</volume>:<fpage>104898</fpage>. <pub-id pub-id-type="doi">10.1016/j.jstrokecerebrovasdis.2020.104898</pub-id><pub-id pub-id-type="pmid">32417239</pub-id></citation></ref>
<ref id="B5">
<label>5.</label>
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Lengerich</surname> <given-names>B</given-names></name> <name><surname>Tan</surname> <given-names>S</given-names></name> <name><surname>Chang</surname> <given-names>CH</given-names></name> <name><surname>Hooker</surname> <given-names>G</given-names></name> <name><surname>Caruana</surname> <given-names>R</given-names></name></person-group>. <article-title>Purifying interaction effects with the functional anova: An efficient algorithm for recovering identifiable additive models</article-title>. In: <source>International Conference on Artificial Intelligence and Statistics.</source> <publisher-loc>PMLR</publisher-loc> (<year>2020</year>). p. <fpage>2402</fpage>&#x02013;<lpage>12</lpage>.</citation>
</ref>
<ref id="B6">
<label>6.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gennings</surname> <given-names>C</given-names></name> <name><surname>Carter</surname> <given-names>WH</given-names> <suffix>Jr</suffix></name> <name><surname>Carchman</surname> <given-names>RA</given-names></name> <name><surname>Teuschler</surname> <given-names>LK</given-names></name> <name><surname>Simmons</surname> <given-names>JE</given-names></name> <name><surname>Carney</surname> <given-names>EW</given-names></name> <etal/></person-group>. <article-title>unifying concept for assessing toxicological interactions: changes in slope</article-title>. <source>Toxicol Sci.</source> (<year>2005</year>) <volume>88</volume>:<fpage>287</fpage>&#x02013;<lpage>97</lpage>. <pub-id pub-id-type="doi">10.1093/toxsci/kfi275</pub-id><pub-id pub-id-type="pmid">16081521</pub-id></citation></ref>
<ref id="B7">
<label>7.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mainali</surname> <given-names>S</given-names></name> <name><surname>Darsie</surname> <given-names>ME</given-names></name> <name><surname>Smetana</surname> <given-names>KS</given-names></name></person-group>. <article-title>Machine learning in action: Stroke diagnosis and outcome prediction</article-title>. <source>Front Neurol</source>. (<year>2021</year>) <volume>12</volume>:<fpage>734345</fpage>. <pub-id pub-id-type="doi">10.3389/fneur.2021.734345</pub-id><pub-id pub-id-type="pmid">35937051</pub-id></citation></ref>
<ref id="B8">
<label>8.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bonkhoff</surname> <given-names>AK</given-names></name> <name><surname>R&#x000FC;bsamen</surname> <given-names>N</given-names></name> <name><surname>Grefkes</surname> <given-names>C</given-names></name> <name><surname>Rost</surname> <given-names>NS</given-names></name> <name><surname>Berger</surname> <given-names>K</given-names></name> <name><surname>Karch</surname> <given-names>A</given-names></name></person-group>. <article-title>Development and validation of prediction models for severe complications after acute ischemic stroke: a study based on the stroke Registry of Northwestern Germany</article-title>. <source>J Am Heart Assoc.</source> (<year>2022</year>) <volume>11</volume>:<fpage>e023175</fpage>. <pub-id pub-id-type="doi">10.1161/JAHA.121.023175</pub-id><pub-id pub-id-type="pmid">35253466</pub-id></citation></ref>
<ref id="B9">
<label>9.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Qutrio Baloch</surname> <given-names>Z</given-names></name> <name><surname>Raza</surname> <given-names>SA</given-names></name> <name><surname>Pathak</surname> <given-names>R</given-names></name> <name><surname>Marone</surname> <given-names>L</given-names></name> <name><surname>Ali</surname> <given-names>A</given-names></name></person-group>. <article-title>Machine learning confirms nonlinear relationship between severity of peripheral arterial disease, functional limitation and symptom severity</article-title>. <source>Diagnostics.</source> (<year>2020</year>) <volume>10</volume>:<fpage>515</fpage>. <pub-id pub-id-type="doi">10.3390/diagnostics10080515</pub-id><pub-id pub-id-type="pmid">32722280</pub-id></citation></ref>
<ref id="B10">
<label>10.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Singal</surname> <given-names>AG</given-names></name> <name><surname>Mukherjee</surname> <given-names>A</given-names></name> <name><surname>Elmunzer</surname> <given-names>BJ</given-names></name> <name><surname>Higgins</surname> <given-names>PD</given-names></name> <name><surname>Lok</surname> <given-names>AS</given-names></name> <name><surname>Zhu</surname> <given-names>J</given-names></name> <etal/></person-group>. <article-title>Machine learning algorithms outperform conventional regression models in predicting development of hepatocellular carcinoma</article-title>. <source>Am J Gastroenterol.</source> (<year>2013</year>) <volume>108</volume>:<fpage>1723</fpage>. <pub-id pub-id-type="doi">10.1038/ajg.2013.332</pub-id><pub-id pub-id-type="pmid">24169273</pub-id></citation></ref>
<ref id="B11">
<label>11.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tong</surname> <given-names>DC</given-names></name> <name><surname>Adami</surname> <given-names>A</given-names></name> <name><surname>Moseley</surname> <given-names>ME</given-names></name> <name><surname>Marks</surname> <given-names>MP</given-names></name></person-group>. <article-title>Relationship between apparent diffusion coefficient and subsequent hemorrhagic transformation following acute ischemic stroke</article-title>. <source>Stroke.</source> (<year>2000</year>) <volume>31</volume>:<fpage>2378</fpage>&#x02013;<lpage>84</lpage>. <pub-id pub-id-type="doi">10.1161/01.STR.31.10.2378</pub-id><pub-id pub-id-type="pmid">11022067</pub-id></citation></ref>
<ref id="B12">
<label>12.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yates</surname> <given-names>PA</given-names></name> <name><surname>Villemagne</surname> <given-names>VL</given-names></name> <name><surname>Ellis</surname> <given-names>KA</given-names></name> <name><surname>Desmond</surname> <given-names>PM</given-names></name> <name><surname>Masters</surname> <given-names>CL</given-names></name> <name><surname>Rowe</surname> <given-names>CC</given-names></name></person-group>. <article-title>Cerebral microbleeds: a review of clinical, genetic, and neuroimaging associations</article-title>. <source>Front Neurol.</source> (<year>2014</year>) <volume>4</volume>:<fpage>205</fpage>. <pub-id pub-id-type="doi">10.3389/fneur.2013.00205</pub-id><pub-id pub-id-type="pmid">24432010</pub-id></citation></ref>
<ref id="B13">
<label>13.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cordonnier</surname> <given-names>C</given-names></name> <name><surname>Potter</surname> <given-names>GM</given-names></name> <name><surname>Jackson</surname> <given-names>CA</given-names></name> <name><surname>Doubal</surname> <given-names>F</given-names></name> <name><surname>Keir</surname> <given-names>S</given-names></name> <name><surname>Sudlow</surname> <given-names>CLM</given-names></name> <etal/></person-group>. <article-title>Improving interrater agreement about brain microbleeds: development of the Brain Observer MicroBleed Scale (BOMBS)</article-title>. <source>Stroke.</source> (<year>2009</year>) <volume>40</volume>:<fpage>94</fpage>&#x02013;<lpage>9</lpage>. <pub-id pub-id-type="doi">10.1161/STROKEAHA.108.526996</pub-id><pub-id pub-id-type="pmid">19008468</pub-id></citation></ref>
<ref id="B14">
<label>14.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Charidimou</surname> <given-names>A</given-names></name> <name><surname>J&#x000E4;ger</surname> <given-names>RH</given-names></name> <name><surname>Fox</surname> <given-names>Z</given-names></name> <name><surname>Peeters</surname> <given-names>A</given-names></name> <name><surname>Vandermeeren</surname> <given-names>Y</given-names></name> <name><surname>Laloux</surname> <given-names>P</given-names></name> <etal/></person-group>. <article-title>Prevalence and mechanisms of cortical superficial siderosis in cerebral amyloid angiopathy</article-title>. <source>Neurology.</source> (<year>2013</year>) <volume>81</volume>:<fpage>626</fpage>&#x02013;<lpage>32</lpage>. <pub-id pub-id-type="doi">10.1212/WNL.0b013e3182a08f2c</pub-id><pub-id pub-id-type="pmid">23864315</pub-id></citation></ref>
<ref id="B15">
<label>15.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pan</surname> <given-names>SL</given-names></name> <name><surname>Wu</surname> <given-names>SC</given-names></name> <name><surname>Wu</surname> <given-names>TH</given-names></name> <name><surname>Lee</surname> <given-names>TK</given-names></name> <name><surname>Chen</surname> <given-names>TH</given-names></name></person-group>. <article-title>Location and size of infarct on functional outcome of noncardioembolic ischemic stroke</article-title>. <source>Disabil Rehabil</source>. (<year>2006</year>) <volume>28</volume>:<fpage>977</fpage>&#x02013;<lpage>83</lpage>. <pub-id pub-id-type="doi">10.1080/09638280500404438</pub-id><pub-id pub-id-type="pmid">16882637</pub-id></citation></ref>
<ref id="B16">
<label>16.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bamford</surname> <given-names>J</given-names></name> <name><surname>Sandercock</surname> <given-names>P</given-names></name> <name><surname>Dennis</surname> <given-names>M</given-names></name> <name><surname>Warlow</surname> <given-names>C</given-names></name> <name><surname>Burn</surname> <given-names>JJ</given-names></name></person-group>. <article-title>Classification and natural history of clinically identifiable subtypes of cerebral infarction</article-title>. <source>Lancet.</source> (<year>1991</year>) <volume>337</volume>:<fpage>1521</fpage>&#x02013;<lpage>6</lpage>. <pub-id pub-id-type="doi">10.1016/0140-6736(91)93206-O</pub-id><pub-id pub-id-type="pmid">1675378</pub-id></citation></ref>
<ref id="B17">
<label>17.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chung</surname> <given-names>JW</given-names></name> <name><surname>Park</surname> <given-names>SH</given-names></name> <name><surname>Kim</surname> <given-names>N</given-names></name> <name><surname>Kim</surname> <given-names>WJ</given-names></name> <name><surname>Park</surname> <given-names>JH</given-names></name> <name><surname>Ko</surname> <given-names>Y</given-names></name> <etal/></person-group>. <article-title>Trial of ORG 10172 in Acute Stroke Treatment (TOAST) classification and vascular territory of ischemic stroke lesions diagnosed by diffusion-weighted imaging</article-title>. <source>J Am Heart Assoc.</source> (<year>2014</year>) <volume>3</volume>:<fpage>e001119</fpage>. <pub-id pub-id-type="doi">10.1161/JAHA.114.001119</pub-id><pub-id pub-id-type="pmid">25112556</pub-id></citation></ref>
<ref id="B18">
<label>18.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Paciaroni</surname> <given-names>M</given-names></name> <name><surname>Agnelli</surname> <given-names>G</given-names></name> <name><surname>Corea</surname> <given-names>F</given-names></name> <name><surname>Ageno</surname> <given-names>W</given-names></name> <name><surname>Alberti</surname> <given-names>A</given-names></name> <name><surname>Lanari</surname> <given-names>A</given-names></name> <etal/></person-group>. <article-title>Early hemorrhagic transformation of brain infarction: rate, predictive factors, and influence on clinical outcome: results of a prospective multicenter study</article-title>. <source>Stroke.</source> (<year>2008</year>) <volume>39</volume>:<fpage>2249</fpage>&#x02013;<lpage>56</lpage>. <pub-id pub-id-type="doi">10.1161/STROKEAHA.107.510321</pub-id><pub-id pub-id-type="pmid">18535273</pub-id></citation></ref>
<ref id="B19">
<label>19.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Larrue</surname> <given-names>V</given-names></name> <name><surname>von Kummer</surname> <given-names>R</given-names></name> <name><surname>M&#x000FC;ller</surname> <given-names>A</given-names></name> <name><surname>Bluhmki</surname> <given-names>E</given-names></name></person-group>. <article-title>Risk factors for severe hemorrhagic transformation in ischemic stroke patients treated with recombinant tissue plasminogen activator: a secondary analysis of the European-Australasian Acute Stroke Study (ECASS II)</article-title>. <source>Stroke.</source> (<year>2001</year>) <volume>32</volume>:<fpage>438</fpage>&#x02013;<lpage>41</lpage>. <pub-id pub-id-type="doi">10.1161/01.STR.32.2.438</pub-id><pub-id pub-id-type="pmid">11157179</pub-id></citation></ref>
<ref id="B20">
<label>20.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dash</surname> <given-names>M</given-names></name> <name><surname>Liu</surname> <given-names>H</given-names></name></person-group>. <article-title>Consistency-based search in feature selection</article-title>. <source>Artif Intell.</source> (<year>2003</year>) <volume>151</volume>:<fpage>155</fpage>&#x02013;<lpage>76</lpage>. <pub-id pub-id-type="doi">10.1016/S0004-3702(03)00079-1</pub-id><pub-id pub-id-type="pmid">32133955</pub-id></citation></ref>
<ref id="B21">
<label>21.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kumar</surname> <given-names>G</given-names></name> <name><surname>Kumar</surname> <given-names>K</given-names></name></person-group>. <article-title>An information theoretic approach for feature selection</article-title>. <source>Sec Commun Netw.</source> (<year>2012</year>) <volume>5</volume>:<fpage>178</fpage>&#x02013;<lpage>85</lpage>. <pub-id pub-id-type="doi">10.1002/sec.303</pub-id><pub-id pub-id-type="pmid">35316469</pub-id></citation></ref>
<ref id="B22">
<label>22.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kratzer</surname> <given-names>G</given-names></name> <name><surname>Furrer</surname> <given-names>R</given-names></name></person-group>. <article-title>Varrank: An R package for variable ranking based on mutual information with applications to observed systemic datasets</article-title>. <source>arXiv preprint arXiv:1804.07134</source>. (<year>2018</year>). <pub-id pub-id-type="doi">10.48550/arXiv.1804.07134</pub-id></citation>
</ref>
<ref id="B23">
<label>23.</label>
<citation citation-type="web"><person-group person-group-type="author"><collab>R Core Team</collab></person-group>. <source>R: A Language Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna</source> (<year>2021</year>). Available online at: <ext-link ext-link-type="uri" xlink:href="https://www.R-project.org/">https://www.R-project.org/</ext-link></citation>
</ref>
<ref id="B24">
<label>24.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tourassi</surname> <given-names>GD</given-names></name> <name><surname>Frederick</surname> <given-names>ED</given-names></name> <name><surname>Markey</surname> <given-names>MK</given-names></name> <name><surname>Floyd</surname> <given-names>CE</given-names> <suffix>Jr</suffix></name></person-group>. <article-title>Application of the mutual information criterion for feature selection in computer-aided diagnosis</article-title>. <source>Med Phys.</source> (<year>2001</year>) <volume>28</volume>:<fpage>2394</fpage>&#x02013;<lpage>402</lpage>. <pub-id pub-id-type="doi">10.1118/1.1418724</pub-id><pub-id pub-id-type="pmid">11797941</pub-id></citation></ref>
<ref id="B25">
<label>25.</label>
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Jung</surname> <given-names>A</given-names></name></person-group>. <source>Machine Learning: The Basics</source>. <publisher-loc>Singapore</publisher-loc>: <publisher-name>Springer</publisher-name> (<year>2022</year>). <pub-id pub-id-type="doi">10.1007/978-981-16-8193-6</pub-id></citation>
</ref>
<ref id="B26">
<label>26.</label>
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Bergstra</surname> <given-names>J</given-names></name> <name><surname>Yamins</surname> <given-names>D</given-names></name> <name><surname>Cox</surname> <given-names>D</given-names></name></person-group>. <article-title>Making a science of model search: Hyperparameter optimization in hundreds of dimensions for vision architectures</article-title>. In: <source>International Conference on Machine Learning.</source> <publisher-loc>PMLR</publisher-loc> (<year>2013</year>). p. <fpage>115</fpage>&#x02013;<lpage>23</lpage>.</citation>
</ref>
<ref id="B27">
<label>27.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bergstra</surname> <given-names>J</given-names></name> <name><surname>Bardenet</surname> <given-names>R</given-names></name> <name><surname>Bengio</surname> <given-names>Y</given-names></name> <name><surname>K&#x000E9;gl</surname> <given-names>B</given-names></name></person-group>. <article-title>Algorithms for hyper-parameter optimization</article-title>. In:<person-group person-group-type="editor"><name><surname>Shawe-Taylor</surname> <given-names>J</given-names></name> <name><surname>Zemel</surname> <given-names>RS</given-names></name> <name><surname>Bartlett</surname> <given-names>PL</given-names></name> <name><surname>Pereira</surname> <given-names>FCN</given-names></name> <name><surname>Weinberger</surname> <given-names>KQ</given-names></name></person-group>, editors. <source>NIPS</source>. (<year>2011</year>). p. <fpage>2546</fpage>&#x02013;<lpage>54</lpage>.</citation>
</ref>
<ref id="B28">
<label>28.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bradley</surname> <given-names>AP</given-names></name></person-group>. <article-title>The use of the area under the ROC curve in the evaluation of machine learning algorithms</article-title>. <source>Pattern Recognit.</source> (<year>1997</year>) <volume>30</volume>:<fpage>1145</fpage>&#x02013;<lpage>59</lpage>. <pub-id pub-id-type="doi">10.1016/S0031-3203(96)00142-2</pub-id></citation>
</ref>
<ref id="B29">
<label>29.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>DeLong</surname> <given-names>ER</given-names></name> <name><surname>DeLong</surname> <given-names>DM</given-names></name> <name><surname>Clarke-Pearson</surname> <given-names>DL</given-names></name></person-group>. <article-title>Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach</article-title>. <source>Biometrics</source>. (<year>1988</year>) <volume>44</volume>:<fpage>837</fpage>&#x02013;<lpage>45</lpage>. <pub-id pub-id-type="doi">10.2307/2531595</pub-id><pub-id pub-id-type="pmid">3203132</pub-id></citation></ref>
<ref id="B30">
<label>30.</label>
<citation citation-type="web"><person-group person-group-type="author"><name><surname>Ling</surname> <given-names>CX</given-names></name> <name><surname>Huang</surname> <given-names>J</given-names></name> <name><surname>Zhang</surname> <given-names>H</given-names></name></person-group>. <article-title>AUC: a better measure than accuracy in comparing learning algorithms</article-title>. In: <source>Conference of the Canadian Society for Computational Studies of Intelligence</source>. <publisher-loc>Berlin, Heidelberg</publisher-loc>: <publisher-name>Springer</publisher-name>\ (<year>2003</year>). p. <fpage>329</fpage>&#x02013;<lpage>41</lpage>. Available online at: <ext-link ext-link-type="uri" xlink:href="https://cran.r-project.org/web/packages/mgcv/mgcv.pdf">https://cran.r-project.org/web/packages/mgcv/mgcv.pdf</ext-link></citation>
</ref>
<ref id="B31">
<label>31.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wood</surname> <given-names>S</given-names></name> <name><surname>Wood</surname> <given-names>MS</given-names></name></person-group>. <article-title>Package &#x02018;mgcv&#x00027;</article-title>. R package version. (<year>2015</year>) <volume>1</volume>:<fpage>729</fpage>. </citation>
</ref>
<ref id="B32">
<label>32.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pedregosa</surname> <given-names>F</given-names></name> <name><surname>Varoquaux</surname> <given-names>G</given-names></name> <name><surname>Gramfort</surname> <given-names>A</given-names></name> <name><surname>Michel</surname> <given-names>V</given-names></name> <name><surname>Thirion</surname> <given-names>B</given-names></name> <name><surname>Grisel</surname> <given-names>O</given-names></name> <etal/></person-group>. <article-title>Scikit-learn: machine learning in Python</article-title>. <source>J Mach Learn Technol.</source> (<year>2011</year>) <volume>12</volume>:<fpage>2825</fpage>&#x02013;<lpage>30</lpage>.</citation>
</ref>
<ref id="B33">
<label>33.</label>
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Kluyver</surname> <given-names>T</given-names></name> <name><surname>Ragan-Kelley</surname> <given-names>B</given-names></name> <name><surname>P&#x000E9;rez</surname> <given-names>F</given-names></name> <name><surname>Granger</surname> <given-names>B</given-names></name> <name><surname>Bussonnier</surname> <given-names>M</given-names></name> <name><surname>Frederic</surname> <given-names>J</given-names></name> <etal/></person-group>. <article-title>Jupyter Notebooks &#x02013; a publishing format for reproducible computational workflows</article-title>. In: <source>Positioning and Power in Academic Publishing: Players, Agents and Agendas</source>. <publisher-loc>Amsterdam</publisher-loc>: <publisher-name>IOS Press</publisher-name> (<year>2016</year>). p. <fpage>87</fpage>&#x02013;<lpage>90</lpage>.</citation>
</ref>
<ref id="B34">
<label>34.</label>
<citation citation-type="web"><person-group person-group-type="author"><collab>Plotly Technologies Inc</collab></person-group>. <source>Plotly Visualization Library.</source> (<year>2015</year>). Available online at: <ext-link ext-link-type="uri" xlink:href="https://plot.ly">https://plot.ly</ext-link></citation>
</ref>
<ref id="B35">
<label>35.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tan</surname> <given-names>S</given-names></name> <name><surname>Wang</surname> <given-names>D</given-names></name> <name><surname>Liu</surname> <given-names>M</given-names></name> <name><surname>Zhang</surname> <given-names>S</given-names></name> <name><surname>Wu</surname> <given-names>B</given-names></name> <name><surname>Liu</surname> <given-names>B</given-names></name></person-group>. <article-title>Frequency and predictors of spontaneous hemorrhagic transformation in ischemic stroke and its association with prognosis</article-title>. <source>J Neurol.</source> (<year>2014</year>) <volume>261</volume>:<fpage>905</fpage>&#x02013;<lpage>12</lpage>. <pub-id pub-id-type="doi">10.1007/s00415-014-7297-8</pub-id><pub-id pub-id-type="pmid">24590407</pub-id></citation></ref>
<ref id="B36">
<label>36.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pande</surname> <given-names>SD</given-names></name> <name><surname>Win</surname> <given-names>MM</given-names></name> <name><surname>Khine</surname> <given-names>AA</given-names></name> <name><surname>Zaw</surname> <given-names>EM</given-names></name> <name><surname>Manoharraj</surname> <given-names>N</given-names></name> <name><surname>Lolong</surname> <given-names>L</given-names></name> <etal/></person-group>. <article-title>Haemorrhagic transformation following ischaemic stroke: a retrospective study</article-title>. <source>Sci Rep.</source> (<year>2020</year>) <volume>10</volume>:<fpage>1</fpage>&#x02013;<lpage>9</lpage>. <pub-id pub-id-type="doi">10.1038/s41598-020-62230-5</pub-id><pub-id pub-id-type="pmid">32210323</pub-id></citation></ref>
<ref id="B37">
<label>37.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Peng</surname> <given-names>H</given-names></name> <name><surname>Long</surname> <given-names>F</given-names></name> <name><surname>Ding</surname> <given-names>C</given-names></name></person-group>. <article-title>Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy</article-title>. <source>IEEE Trans Pattern Anal Mach Intell.</source> (<year>2005</year>) <volume>27</volume>:<fpage>1226</fpage>&#x02013;<lpage>38</lpage>. <pub-id pub-id-type="doi">10.1109/TPAMI.2005.159</pub-id><pub-id pub-id-type="pmid">16119262</pub-id></citation></ref>
<ref id="B38">
<label>38.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Terruso</surname> <given-names>V</given-names></name> <name><surname>D&#x00027;Amelio</surname> <given-names>M</given-names></name> <name><surname>Di Benedetto</surname> <given-names>N</given-names></name> <name><surname>Lupo</surname> <given-names>I</given-names></name> <name><surname>Saia</surname> <given-names>V</given-names></name> <name><surname>Famoso</surname> <given-names>G</given-names></name> <etal/></person-group>. <article-title>Frequency and determinants for hemorrhagic transformation of cerebral infarction</article-title>. <source>Neuroepidemiology.</source> (<year>2009</year>) <volume>33</volume>:<fpage>261</fpage>&#x02013;<lpage>5</lpage>. <pub-id pub-id-type="doi">10.1159/000229781</pub-id><pub-id pub-id-type="pmid">19641332</pub-id></citation></ref>
<ref id="B39">
<label>39.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pundik</surname> <given-names>S</given-names></name> <name><surname>McWilliams-Dunnigan</surname> <given-names>L</given-names></name> <name><surname>Blackham</surname> <given-names>KL</given-names></name> <name><surname>Kirchner</surname> <given-names>HL</given-names></name> <name><surname>Sundararajan</surname> <given-names>S</given-names></name> <name><surname>Sunshine</surname> <given-names>JL</given-names></name> <etal/></person-group>. <article-title>Older age does not increase risk of hemorrhagic complications after intravenous and/or intra-arterial thrombolysis for acute stroke</article-title>. <source>J Stroke Cerebrovasc Dis.</source> (<year>2008</year>) <volume>17</volume>:<fpage>266</fpage>&#x02013;<lpage>72</lpage>. <pub-id pub-id-type="doi">10.1016/j.jstrokecerebrovasdis.2008.03.003</pub-id><pub-id pub-id-type="pmid">18755405</pub-id></citation></ref>
<ref id="B40">
<label>40.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Saposnik</surname> <given-names>G</given-names></name> <name><surname>Guzik</surname> <given-names>AK</given-names></name> <name><surname>Reeves</surname> <given-names>M</given-names></name> <name><surname>Ovbiagele</surname> <given-names>B</given-names></name> <name><surname>Johnston</surname> <given-names>SC</given-names></name></person-group>. <article-title>Stroke prognostication using age and NIH Stroke Scale: SPAN-100</article-title>. <source>Neurology.</source> (<year>2013</year>) <volume>80</volume>:<fpage>21</fpage>&#x02013;<lpage>8</lpage>. <pub-id pub-id-type="doi">10.1212/WNL.0b013e31827b1ace</pub-id><pub-id pub-id-type="pmid">23918864</pub-id></citation></ref>
<ref id="B41">
<label>41.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kalinin</surname> <given-names>MN</given-names></name> <name><surname>Khasanova</surname> <given-names>DR</given-names></name> <name><surname>Ibatullin</surname> <given-names>MM</given-names></name></person-group>. <article-title>The hemorrhagic transformation index score: a prediction tool in middle cerebral artery ischemic stroke</article-title>. <source>BMC Neurol.</source> (<year>2017</year>) <volume>17</volume>:<fpage>177</fpage>. <pub-id pub-id-type="doi">10.1186/s12883-017-0958-3</pub-id><pub-id pub-id-type="pmid">28882130</pub-id></citation></ref>
<ref id="B42">
<label>42.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stone</surname> <given-names>JA</given-names></name> <name><surname>Willey</surname> <given-names>JZ</given-names></name> <name><surname>Keyrouz</surname> <given-names>S</given-names></name> <name><surname>Butera</surname> <given-names>J</given-names></name> <name><surname>McTaggart</surname> <given-names>RA</given-names></name> <name><surname>Cutting</surname> <given-names>S</given-names></name> <etal/></person-group>. <article-title>Therapies for hemorrhagic transformation in acute ischemic stroke</article-title>. <source>Curr Treat Options Neurol.</source> (<year>2017</year>) <volume>19</volume>:<fpage>1</fpage>. <pub-id pub-id-type="doi">10.1007/s11940-017-0438-5</pub-id><pub-id pub-id-type="pmid">33161301</pub-id></citation></ref>
<ref id="B43">
<label>43.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Charidimou</surname> <given-names>A</given-names></name> <name><surname>Tur</surname> <given-names>G</given-names></name> <name><surname>Oppenheim</surname> <given-names>C</given-names></name> <name><surname>Yan</surname> <given-names>S</given-names></name> <name><surname>Scheitz</surname> <given-names>JF</given-names></name> <name><surname>Erdur</surname> <given-names>H</given-names></name> <etal/></person-group>. <article-title>Microbleeds, cerebral hemorrhage, and functional outcome after stroke thrombolysis individual patient data meta-analysis</article-title>. <source>Stroke.</source> (<year>2017</year>) <volume>48</volume>:<fpage>2084</fpage>&#x02013;<lpage>90</lpage>. <pub-id pub-id-type="doi">10.1161/STROKEAHA.116.012992</pub-id><pub-id pub-id-type="pmid">29030477</pub-id></citation></ref>
<ref id="B44">
<label>44.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dar</surname> <given-names>NZ</given-names></name> <name><surname>Ain</surname> <given-names>QU</given-names></name> <name><surname>Nazir</surname> <given-names>R</given-names></name> <name><surname>Ahmad</surname> <given-names>A</given-names></name></person-group>. <article-title>Cerebral microbleeds in an acute ischemic stroke as a predictor of hemorrhagic transformation</article-title>. <source>Cureus</source>. (<year>2018</year>) <volume>10</volume>:<fpage>e3308</fpage>. <pub-id pub-id-type="doi">10.7759/cureus.3308</pub-id><pub-id pub-id-type="pmid">32175198</pub-id></citation></ref>
<ref id="B45">
<label>45.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Poels</surname> <given-names>MM</given-names></name> <name><surname>Vernooij</surname> <given-names>MW</given-names></name> <name><surname>Ikram</surname> <given-names>MA</given-names></name> <name><surname>Hofman</surname> <given-names>A</given-names></name> <name><surname>Krestin</surname> <given-names>GP</given-names></name> <name><surname>van der Lugt</surname> <given-names>A</given-names></name> <etal/></person-group>. <article-title>Prevalence and risk factors of cerebral microbleeds: an update of the Rotterdam scan study</article-title>. <source>Stroke.</source> (<year>2010</year>) <volume>41</volume>:<fpage>S103</fpage>&#x02013;<lpage>6</lpage>. <pub-id pub-id-type="doi">10.1161/STROKEAHA.110.595181</pub-id><pub-id pub-id-type="pmid">20876479</pub-id></citation></ref>
<ref id="B46">
<label>46.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wardlaw</surname> <given-names>JM</given-names></name> <name><surname>Smith</surname> <given-names>EE</given-names></name> <name><surname>Biessels</surname> <given-names>GJ</given-names></name> <name><surname>Cordonnier</surname> <given-names>C</given-names></name> <name><surname>Fazekas</surname> <given-names>F</given-names></name> <name><surname>Frayne</surname> <given-names>R</given-names></name> <etal/></person-group>. <article-title>Neuroimaging standards for research into small vessel disease and its contribution to ageing and neurodegeneration</article-title>. <source>Lancet Neurol</source>. (<year>2013</year>) <volume>12</volume>:<fpage>822</fpage>&#x02013;<lpage>38</lpage>. <pub-id pub-id-type="doi">10.1016/S1474-4422(13)70124-8</pub-id><pub-id pub-id-type="pmid">23867200</pub-id></citation></ref>
<ref id="B47">
<label>47.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lee</surname> <given-names>J</given-names></name> <name><surname>Sohn</surname> <given-names>EH</given-names></name> <name><surname>Oh</surname> <given-names>E</given-names></name> <name><surname>Lee</surname> <given-names>AY</given-names></name></person-group>. <article-title>Characteristics of cerebral microbleeds</article-title>. <source>Dement Neurocogn Disord.</source> (<year>2018</year>) <volume>17</volume>:<fpage>73</fpage>&#x02013;<lpage>82</lpage>. <pub-id pub-id-type="doi">10.12779/dnd.2018.17.3.73</pub-id><pub-id pub-id-type="pmid">30906396</pub-id></citation></ref>
<ref id="B48">
<label>48.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ryu</surname> <given-names>WS</given-names></name> <name><surname>Jeong</surname> <given-names>SW</given-names></name> <name><surname>Kim</surname> <given-names>DE</given-names></name></person-group>. <article-title>Total small vessel disease burden and functional outcome in patients with ischemic stroke</article-title>. <source>PLoS ONE.</source> (<year>2020</year>) <volume>15</volume>:<fpage>e0242319</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0242319</pub-id><pub-id pub-id-type="pmid">33180837</pub-id></citation></ref>
<ref id="B49">
<label>49.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Charidimou</surname> <given-names>A</given-names></name> <name><surname>Werring</surname> <given-names>DJ</given-names></name></person-group>. <article-title>Cerebral microbleeds: detection, mechanisms and clinical challenges</article-title>. <source>Fut Neurol</source>. (<year>2011</year>) <volume>6</volume>:<fpage>587</fpage>&#x02013;<lpage>611</lpage>. <pub-id pub-id-type="doi">10.2217/fnl.11.42</pub-id></citation>
</ref>
<ref id="B50">
<label>50.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Appelros</surname> <given-names>P</given-names></name> <name><surname>Nydevik</surname> <given-names>I</given-names></name> <name><surname>Seiger</surname> <given-names>A</given-names></name> <name><surname>Ter&#x000E9;nt</surname> <given-names>A</given-names></name></person-group>. <article-title>Predictors of severe stroke influence of preexisting dementia and cardiac disorders</article-title>. <source>Stroke.</source> (<year>2002</year>) <volume>33</volume>:<fpage>2357</fpage>&#x02013;<lpage>62</lpage>. <pub-id pub-id-type="doi">10.1161/01.STR.0000030318.99727.FA</pub-id><pub-id pub-id-type="pmid">12364721</pub-id></citation></ref>
<ref id="B51">
<label>51.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Corso</surname> <given-names>G</given-names></name> <name><surname>Bottacchi</surname> <given-names>E</given-names></name> <name><surname>Tosi</surname> <given-names>P</given-names></name> <name><surname>Caligiana</surname> <given-names>L</given-names></name> <name><surname>Lia</surname> <given-names>C</given-names></name> <name><surname>Veronese Morosini</surname> <given-names>M</given-names></name> <etal/></person-group>. <article-title>Outcome predictors in first-ever ischemic stroke patients: a population-based study</article-title>. <source>Int Sch Res Notices.</source> (<year>2014</year>) <volume>2014</volume>:<fpage>904647</fpage>. <pub-id pub-id-type="doi">10.1155/2014/904647</pub-id><pub-id pub-id-type="pmid">27437502</pub-id></citation></ref>
<ref id="B52">
<label>52.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lee</surname> <given-names>JY</given-names></name> <name><surname>Sunwoo</surname> <given-names>JS</given-names></name> <name><surname>Kwon</surname> <given-names>KY</given-names></name> <name><surname>Roh</surname> <given-names>H</given-names></name> <name><surname>Ahn</surname> <given-names>MY</given-names></name> <name><surname>Lee</surname> <given-names>MH</given-names></name> <etal/></person-group>. <article-title>Left ventricular ejection fraction predicts post stroke cardiovascular events and mortality in patients without atrial fibrillation and coronary heart disease</article-title>. <source>Korean Circ J</source>. (<year>2018</year>) <volume>48</volume>:<fpage>1148</fpage>&#x02013;<lpage>56</lpage>. <pub-id pub-id-type="doi">10.4070/kcj.2018.0115</pub-id><pub-id pub-id-type="pmid">30403019</pub-id></citation></ref>
<ref id="B53">
<label>53.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lin</surname> <given-names>L</given-names></name> <name><surname>Chen</surname> <given-names>C</given-names></name> <name><surname>Tian</surname> <given-names>H</given-names></name> <name><surname>Bivard</surname> <given-names>A</given-names></name> <name><surname>Spratt</surname> <given-names>N</given-names></name> <name><surname>Levi</surname> <given-names>CR</given-names></name> <etal/></person-group>. <article-title>Perfusion computed tomography accurately quantifies collateral flow after acute ischemic stroke</article-title>. <source>Stroke.</source> (<year>2020</year>) <volume>51</volume>:<fpage>1006</fpage>&#x02013;<lpage>9</lpage>. <pub-id pub-id-type="doi">10.1161/STROKEAHA.119.028284</pub-id><pub-id pub-id-type="pmid">31948385</pub-id></citation></ref>
<ref id="B54">
<label>54.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Castro</surname> <given-names>P</given-names></name> <name><surname>Serrador</surname> <given-names>JM</given-names></name> <name><surname>Rocha</surname> <given-names>I</given-names></name> <name><surname>Sorond</surname> <given-names>F</given-names></name> <name><surname>Azevedo</surname> <given-names>E</given-names></name></person-group>. <article-title>Efficacy of cerebral autoregulation in early ischemic stroke predicts smaller infarcts and better outcome</article-title>. <source>Front Neurol</source>. (<year>2017</year>) <volume>8</volume>:<fpage>113</fpage>. <pub-id pub-id-type="doi">10.3389/fneur.2017.00113</pub-id><pub-id pub-id-type="pmid">28392777</pub-id></citation></ref>
<ref id="B55">
<label>55.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Silverman</surname> <given-names>A</given-names></name> <name><surname>Kodali</surname> <given-names>S</given-names></name> <name><surname>Sheth</surname> <given-names>KN</given-names></name> <name><surname>Petersen</surname> <given-names>NH</given-names></name></person-group>. <article-title>Hemodynamics and hemorrhagic transformation after endovascular therapy for ischemic stroke</article-title>. <source>Front Neurol.</source> (<year>2020</year>) <volume>11</volume>:<fpage>728</fpage>. <pub-id pub-id-type="doi">10.3389/fneur.2020.00728</pub-id><pub-id pub-id-type="pmid">32765416</pub-id></citation></ref>
<ref id="B56">
<label>56.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ingala</surname> <given-names>S</given-names></name> <name><surname>Mazzai</surname> <given-names>L</given-names></name> <name><surname>Sudre</surname> <given-names>CH</given-names></name> <name><surname>Salvad&#x000F3;</surname> <given-names>G</given-names></name> <name><surname>Brugulat-Serrat</surname> <given-names>A</given-names></name> <name><surname>Wottschel</surname> <given-names>V</given-names></name> <etal/></person-group>. <article-title>The relation between APOE genotype and cerebral microbleeds in cognitively unimpaired middle- and old-aged individuals</article-title>. <source>Neurobiol Aging.</source> (<year>2020</year>) <volume>95</volume>:<fpage>104</fpage>&#x02013;<lpage>14</lpage>. <pub-id pub-id-type="doi">10.1016/j.neurobiolaging.2020.06.015</pub-id><pub-id pub-id-type="pmid">32791423</pub-id></citation></ref>
<ref id="B57">
<label>57.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bahnasy</surname> <given-names>WS</given-names></name> <name><surname>Ragab</surname> <given-names>OA</given-names></name> <name><surname>Elhassanien</surname> <given-names>ME</given-names></name></person-group>. <article-title>Stroke onset to needle delay: Where these golden hours are lost? An Egyptian center experience</article-title>. <source>Eneurologicalsci.</source> (<year>2019</year>) <volume>14</volume>:<fpage>68</fpage>&#x02013;<lpage>71</lpage>. <pub-id pub-id-type="doi">10.1016/j.ensci.2019.01.003</pub-id><pub-id pub-id-type="pmid">30671551</pub-id></citation></ref>
</ref-list> 
</back>
</article>
