<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Neurol.</journal-id>
<journal-title>Frontiers in Neurology</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Neurol.</abbrev-journal-title>
<issn pub-type="epub">1664-2295</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fneur.2022.1014346</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Neurology</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Comparison of ischemic stroke diagnosis models based on machine learning</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Yang</surname> <given-names>Wan-Xia</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="author-notes" rid="fn002"><sup>&#x02020;</sup></xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Wang</surname> <given-names>Fang-Fang</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="author-notes" rid="fn002"><sup>&#x02020;</sup></xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Pan</surname> <given-names>Yun-Yan</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Xie</surname> <given-names>Jian-Qin</given-names></name>
<xref ref-type="aff" rid="aff2"><sup>2</sup></xref>
</contrib>
<contrib contrib-type="author">
<name><surname>Lu</surname> <given-names>Ming-Hua</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>You</surname> <given-names>Chong-Ge</given-names></name>
<xref ref-type="aff" rid="aff1"><sup>1</sup></xref>
<xref ref-type="corresp" rid="c001"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/1756240/overview"/>
</contrib>
</contrib-group>
<aff id="aff1"><sup>1</sup><institution>Laboratory Medicine Center, Lanzhou University Second Hospital</institution>, <addr-line>Lanzhou</addr-line>, <country>China</country></aff>
<aff id="aff2"><sup>2</sup><institution>Anesthesiology Department, Lanzhou University Second Hospital</institution>, <addr-line>Lanzhou</addr-line>, <country>China</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Hari Kishan Reddy Indupuru, University of Texas Health Science Center at Houston, United States</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Anwar P. P. Abdul Majeed, Xi&#x00027;an Jiaotong-Liverpool University, China; Haoyue Zhang, University of California, Los Angeles, United States; Ren Shenghan, Xidian University, China</p></fn>
<corresp id="c001">&#x0002A;Correspondence: Chong-Ge You <email>youchg&#x00040;lzu.edu.cn</email></corresp>
<fn fn-type="other" id="fn001"><p>This article was submitted to Stroke, a section of the journal Frontiers in Neurology</p></fn>
<fn fn-type="equal" id="fn002"><p>&#x02020;These authors have contributed equally to this work and share first authorship</p></fn></author-notes>
<pub-date pub-type="epub">
<day>05</day>
<month>12</month>
<year>2022</year>
</pub-date>
<pub-date pub-type="collection">
<year>2022</year>
</pub-date>
<volume>13</volume>
<elocation-id>1014346</elocation-id>
<history>
<date date-type="received">
<day>08</day>
<month>08</month>
<year>2022</year>
</date>
<date date-type="accepted">
<day>09</day>
<month>11</month>
<year>2022</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2022 Yang, Wang, Pan, Xie, Lu and You.</copyright-statement>
<copyright-year>2022</copyright-year>
<copyright-holder>Yang, Wang, Pan, Xie, Lu and You</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract>
<sec>
<title>Background</title>
<p>The incidence, prevalence, and mortality of ischemic stroke (IS) continue to rise, resulting in a serious global disease burden. The prediction models have a great value in the early prediction and diagnosis of IS.</p>
</sec>
<sec>
<title>Methods</title>
<p>The R software was used to screen the differentially expressed genes (DEGs) of IS and control samples in the datasets GSE16561, GSE58294, and GSE37587 and analyze DEGs for enrichment analysis. The feature genes of IS were obtained by several machine learning algorithms, including the least absolute shrinkage and selector operation (LASSO) logistic regression, the support vector machine-recursive feature elimination (SVM-RFE), and the Random Forest (RF). The IS diagnostic models were constructed based on transcriptomics by machine learning and artificial neural network (ANN).</p>
</sec>
<sec>
<title>Results</title>
<p>A total of 69 DEGs, mainly involved in immune and inflammatory responses, were identified. The pathways enriched in the IS group were complement and coagulation cascades, lysosome, PPAR signaling pathway, regulation of autophagy, and toll-like receptor signaling pathway. The feature genes selected by LASSO, SVM-RFE, and RF were 17, 10, and 12, respectively. The area under the curve (AUC) of the LASSO model in the training dataset, GSE22255, and GSE195442 was 0.969, 0.890, and 1.000. The AUC of the SVM-RFE model was 0.957, 0.805, and 1.000, respectively. The AUC of the RF model was 0.947, 0.935, and 1.000, respectively. The models have good sensitivity, specificity, and accuracy. The AUC of the LASSO&#x0002B;ANN, SVM-RFE&#x0002B;ANN, and RF&#x0002B;ANN models was 1.000, 0.995, and 0.997, respectively, in the training dataset. However, the AUC of LASSO&#x0002B;ANN, SVM-RFE&#x0002B;ANN, and RF&#x0002B;ANN models was 0.688, 0.605, and 0.619, respectively, in the GSE22255 dataset. The AUC of the LASSO&#x0002B;ANN and RF&#x0002B;ANN models was 0.740 and 0.630, respectively, in the GSE195442 dataset. In the training dataset, the sensitivity, specificity, and accuracy of the LASSO&#x0002B;ANN model were 1.000, 1.000, and 1.000, respectively; of the SVM-RFE&#x0002B;ANN model were 0.946, 0.982, and 0.964, respectively; and of the RF&#x0002B;ANN model were 0.964, 1.000, and 0.982, respectively. In the test datasets, the sensitivity was very satisfactory; however, the specificity and accuracy were not good.</p>
</sec>
<sec>
<title>Conclusion</title>
<p>The LASSO, SVM-RFE, and RF models have good prediction abilities. However, the ANN model is efficient at classifying positive samples and is unsuitable at classifying negative samples.</p>
</sec></abstract>
<kwd-group>
<kwd>ischemic stroke</kwd>
<kwd>machine learning</kwd>
<kwd>artificial neural network</kwd>
<kwd>diagnostic model</kwd>
<kwd>transcriptomics</kwd>
</kwd-group>
<counts>
<fig-count count="6"/>
<table-count count="3"/>
<equation-count count="0"/>
<ref-count count="37"/>
<page-count count="11"/>
<word-count count="5453"/>
</counts>
</article-meta>
</front>
<body>
<sec sec-type="intro" id="s1">
<title>Introduction</title>
<p>The Global Burden of Diseases, Injuries, and Risk Factors Study (GBD) showed that there were 12.2 million incident cases of stroke, 101 million prevalent cases of stroke, and 6.55 million deaths from stroke in 2019 (<xref ref-type="bibr" rid="B1">1</xref>). Globally, the incidence and mortality of stroke are on the rise, and stroke remains the second leading cause of death (<xref ref-type="bibr" rid="B2">2</xref>). Especially in China, cerebrovascular disease is the first cause of death, and the lifetime risk of stroke in the Chinese population ranks first in the world (<xref ref-type="bibr" rid="B3">3</xref>). In 2019, there were 3.94 million new stroke cases, 2.19 million deaths from stroke, and 28.76 million prevalent cases of stroke, of which ischemic stroke (IS) accounted for 84.1% in China (<xref ref-type="bibr" rid="B4">4</xref>).</p>
<p>The etiology and pathogenesis of IS are not fully understood. According to epidemiological investigations, IS may be associated with hypertension, high BMI, hyperglycemia, environmental particulate matter pollution, and smoking (<xref ref-type="bibr" rid="B1">1</xref>, <xref ref-type="bibr" rid="B5">5</xref>). As modern medicine tends to be individualized, prevention and treatment strategies based on patient genetic information have always been ideal treatment methods for medical practitioners. Studies (<xref ref-type="bibr" rid="B6">6</xref>) have found that genetic factors also play a very important role in the occurrence of IS. At present, more and more studies believe that the occurrence and poor prognosis of IS are related to the abnormal expression of genes (<xref ref-type="bibr" rid="B7">7</xref>). However, multiple genes are often involved in the occurrence of IS. This inspired us to explore diagnostic and prognostic methods for IS by using multiple disease-specific genes.</p>
<p>At present, there are some limitations to the IS diagnostic techniques commonly used in clinical practice. The diagnosis of IS mainly relies on typical clinical symptoms and brain imaging (<xref ref-type="bibr" rid="B8">8</xref>), while approximately 50% of early IS diagnoses lack specificity in imaging (<xref ref-type="bibr" rid="B9">9</xref>). In addition, most patients are irreversible by the time the diagnosis is confirmed, resulting in a poor prognosis. Although scholars have done a great deal of work in finding biomarkers for IS diagnosis or prognosis, few biomarkers are available in clinical practice (<xref ref-type="bibr" rid="B10">10</xref>). Existing predictive models are mostly based on demographic data and clinical parameters, which may have a high risk of bias and fail to make reliable clinical decisions (<xref ref-type="bibr" rid="B11">11</xref>). Machine learning research is developing rapidly and has become one of the important topics in the field of artificial intelligence. At present, machine learning has become a research hotspot in the field of medical and health data mining (<xref ref-type="bibr" rid="B12">12</xref>). Machine learning algorithms such as the least absolute shrinkage and selector operation (LASSO), support vector machine-recursive feature elimination (SVM-RFE), Random Forest (RF), and the neural network have been proven to be of great value in diagnosing stroke (<xref ref-type="bibr" rid="B13">13</xref>&#x02013;<xref ref-type="bibr" rid="B15">15</xref>).</p>
<p>In this study, we screened differentially expressed genes (DEGs) between IS and control samples in the Gene Expression Omnibus (GEO) database; used LASSO, SVM-RFE, and RF to screen out IS feature genes; and constructed a disease diagnosis model of IS to evaluate the performance of different models on predicting IS.</p>
</sec>
<sec sec-type="methods" id="s2">
<title>Methods</title>
<sec>
<title>Microarray data and processing</title>
<p>The expression profile data and corresponding platform annotation information of microarray datasets, such as GSE16561, GSE58294, GSE37587, GSE22255, and GSE195442, were downloaded from the GEO database (<ext-link ext-link-type="uri" xlink:href="https://www.ncbi.nlm.nih.gov/geo/">https://www.ncbi.nlm.nih.gov/geo/</ext-link>). GSE16561, GSE58294, and GSE37587 were integrated as training datasets, and GSE22255 and GSE195442 were used as test datasets, as shown in <xref ref-type="table" rid="T1">Table 1</xref>. The R software (version 4.1.0) was used to transform the probe names of GSE16561, GSE58294, GSE37587, GSE22255, and GSE195442 matrix data into gene names. After the integration of the GSE16561, GSE58294, and GSE37587 datasets, the data were normalized by log2 transformation for data with large values and averaging for repeated probes. The &#x0201C;sva&#x0201D; package was used to calibrate batch effects. The principal component analysis (PCA) diagram before and after calibration was drawn using the ggplot2 package. Since there are 47 control samples and 176 IS samples in the integrated training dataset, there is a class imbalance. We used the SMOTE algorithm (<xref ref-type="bibr" rid="B16">16</xref>) to adjust for class imbalance. The R software &#x0201C;UBL&#x0201D; package was used.</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p>Ischemic stroke datasets from the GEO database.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th valign="top" align="left"><bold>Group</bold></th>
<th valign="top" align="left"><bold>Dataset</bold></th>
<th valign="top" align="left"><bold>Reference</bold></th>
<th valign="top" align="left"><bold>Data type</bold></th>
<th valign="top" align="left"><bold>Platform</bold></th>
<th valign="top" align="center"><bold>Stroke</bold></th>
<th valign="top" align="center"><bold>Control</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Training dataset</td>
<td valign="top" align="left">GSE16561</td>
<td valign="top" align="left">Barr (<xref ref-type="bibr" rid="B17">17</xref>)</td>
<td valign="top" align="left">Microarray</td>
<td valign="top" align="left">GPL6883</td>
<td valign="top" align="center">39</td>
<td valign="top" align="center">24</td>
</tr>
<tr>
<td valign="top" align="left">Training dataset</td>
<td valign="top" align="left">GSE58294</td>
<td valign="top" align="left">Stamova (<xref ref-type="bibr" rid="B18">18</xref>)</td>
<td valign="top" align="left">Microarray</td>
<td valign="top" align="left">GPL570</td>
<td valign="top" align="center">69</td>
<td valign="top" align="center">23</td>
</tr>
<tr>
<td valign="top" align="left">Training dataset</td>
<td valign="top" align="left">GSE37587</td>
<td valign="top" align="left">Barr (<xref ref-type="bibr" rid="B19">19</xref>)</td>
<td valign="top" align="left">Microarray</td>
<td valign="top" align="left">GPL6883</td>
<td valign="top" align="center">68</td>
<td valign="top" align="center">0</td>
</tr>
<tr>
<td valign="top" align="left">Test dataset</td>
<td valign="top" align="left">GSE22255</td>
<td valign="top" align="left">Krug (<xref ref-type="bibr" rid="B20">20</xref>)</td>
<td valign="top" align="left">Microarray</td>
<td valign="top" align="left">GPL570</td>
<td valign="top" align="center">20</td>
<td valign="top" align="center">20</td>
</tr>
<tr>
<td valign="top" align="left">Test dataset</td>
<td valign="top" align="left">GSE195442</td>
<td valign="top" align="left">Yang (<xref ref-type="bibr" rid="B21">21</xref>)</td>
<td valign="top" align="left">Microarray</td>
<td valign="top" align="left">GPL31275</td>
<td valign="top" align="center">10</td>
<td valign="top" align="center">10</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec>
<title>Screening for differentially expressed genes (DEGs)</title>
<p>The &#x0201C;limma&#x0201D; package was used to screen DEGs of the integrative data of GSE16561, GSE58294, and GSE37587. The screening criteria were set as |log<sub>2</sub>FC| &#x0003E; 0.6 and the adjusted <italic>P-</italic>value was &#x0003C;0.05. The heatmap and volcano plot of DEGs were drawn using the &#x0201C;pheatmap&#x0201D; and &#x0201C;ggplot2&#x0201D; packages, respectively.</p>
</sec>
<sec>
<title>Enrichment analysis</title>
<p>To understand the functions of DEGs, we used the R software &#x0201C;clusterProfiler&#x0201D; package to conduct a Gene Ontology (GO) enrichment analysis and a Gene Set Enrichment Analysis (GSEA) on DEGs. An adjusted <italic>P</italic>-value of &#x0003C;0.05 was considered statistically significant. GO enrichment analysis includes a biological process (BP), a cellular component (CC), and a molecular function (MF).</p>
</sec>
<sec>
<title>Feature selection and model evaluation</title>
<p>To screen out the feature genes of IS, the R was used to perform machine learning analysis on DEGs. The &#x0201C;glmnet&#x0201D; package was used to construct the LASSO model with penalty parameter tuning conducted by ten-fold cross-validation. The response type was set as binomial, and the alpha was set as 1. We selected the feature genes with the minimum error. Besides, the &#x0201C;e1071&#x0201D; package was used to establish the SVM-RFE model to screen out the genes with the minimum cross-validation error. <italic>k</italic> = 10 was the setting for the <italic>k</italic>-fold cross-validation, and the parameter of halving above was identified as 50. The &#x0201C;randomForest&#x0201D; package was used to establish the RF model. The RF model was established to find out the number of random forest trees with the minimum error. We selected 272 trees as the parameter of the random forest model. The &#x0201C;pROC&#x0201D; software package was used to draw the receiver operating characteristic (ROC) curve to validate the accuracy of the model. The dimensionality importance value of the RF model was obtained using the decreasing accuracy method (Gini coefficient method). The performance of prediction models generated by machine learning classifiers was assessed using classification sensitivity, specificity, and the area under the curve (AUC).</p>
</sec>
<sec>
<title>Construction and validation of the ANN model</title>
<p>To build and evaluate the performance of the artificial neural network (ANN) model, we performed gene scoring for feature genes, and the scoring rule was set as follows: if the expression of upregulated genes was greater than the median value, the score was 1; otherwise, the score was 0. If the expression of downregulated genes was greater than the median value, the score was 0; otherwise, the score was 1. The R software package &#x0201C;neuralnet&#x0201D; was used to construct the ANN model of feature genes according to the gene score. We set the hidden layer of the LASSO&#x0002B;ANN, SVM-RFE&#x0002B;ANN, and RF&#x0002B;ANN models as 1. The number of neurons in the hidden layers of the LASSO&#x0002B;ANN, SVM-RFE&#x0002B;ANN, and RF&#x0002B;ANN models was set as 8, 5, and 6, respectively. The activation function &#x0201C;logistic&#x0201D; was used. The IS disease classification model was constructed using the obtained gene weight information.</p>
</sec>
</sec>
<sec sec-type="results" id="s3">
<title>Results</title>
<sec>
<title>Batch calibration and SMOTE algorithm</title>
<p>The GSE16561, GSE58294, and GSE37587 datasets were integrated. To reduce the differences between batches, batch calibration was performed on the two datasets, and PCA was used to verify the effect of data calibration (<xref ref-type="fig" rid="F1">Figures 1A,B</xref>). The class distribution in the integrated dataset is not equal, which is prone to class imbalance. Training classification algorithms with imbalanced data provide inefficient prediction models, which may perform poor classification on a smaller number of samples. Hence, we used SMOTE to fix class imbalance (<xref ref-type="fig" rid="F1">Figures 1C,D</xref>).</p>
<fig id="F1" position="float">
<label>Figure 1</label>
<caption><p>PCA diagram. <bold>(A)</bold> PCA diagram of GSE16561, GSE58294, and GSE37587 datasets before calibration. <bold>(B)</bold> PCA diagram of GSE16561, GSE58294, and GSE37587 datasets after calibration. <bold>(C)</bold> PCA diagram of class distribution before SMOTE. <bold>(D)</bold> PCA diagram of class distribution after SMOTE.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fneur-13-1014346-g0001.tif"/>
</fig>
</sec>
<sec>
<title>Differential gene analysis</title>
<p>To identify the DEGs from IS and control samples, we conducted a Bayesian test on the training dataset and obtained a total of 69 DEGs, of which 46 were upregulated and 23 were downregulated (<xref ref-type="fig" rid="F2">Figures 2A,B</xref>).</p>
<fig id="F2" position="float">
<label>Figure 2</label>
<caption><p>The DEGs between ischemic stroke and control group in the GSE16561, GSE58294, and GSE37587 datasets. <bold>(A)</bold> Heatmap of DEGs. The red and blue represent the significantly upregulated and downregulated DEGs. <bold>(B)</bold> Volcano plot of DEGs. These genes consist of 46 upregulated genes and 23 downregulated genes. The screening criteria were set as |log<sub>2</sub>FC|&#x0003E; 0.6 and adjusted <italic>P</italic>- value of &#x0003C; 0.05.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fneur-13-1014346-g0002.tif"/>
</fig>
</sec>
<sec>
<title>Function and pathway enrichment analysis</title>
<p>The R software was used to perform enrichment analysis on 69 DEGs, as shown in <xref ref-type="fig" rid="F3">Figure 3</xref>. DEGs were mainly enriched in the immune response and the inflammatory response. The biological process involved immune response-regulating signaling, negative regulation of cytokine production, and negative regulation of immune response. The cellular component mainly focused on some granule lumens and granule membranes. The molecular function analysis showed that most of the genes were involved in immune receptor activity, serine-type peptidase activity, serine hydrolase activity, pattern recognition receptor activity, and cytokine receptor activity (<xref ref-type="fig" rid="F3">Figure 3A</xref>).</p>
<fig id="F3" position="float">
<label>Figure 3</label>
<caption><p>Function enrichment analysis. <bold>(A)</bold> GO enrichment analysis of DEGs. The size of the circle indicates the number of genes. The screening criterion was set as adjusted <italic>P</italic> &#x0003C; 0.05. <bold>(B,C)</bold> Enrichment plots from GSEA analysis in the control group and IS group.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fneur-13-1014346-g0003.tif"/>
</fig>
<p>The GSEA analysis indicated that the most enriched pathways in the control group were allograft rejection, antigen processing and presentation, primary immunodeficiency, ribosome, and spliceosome (<xref ref-type="fig" rid="F3">Figure 3B</xref>). In contrast, complement and coagulation cascades, lysosome, PPAR signaling pathway, regulation of autophagy, and toll-like receptor (TLR) signaling pathway were enriched in the IS group (<xref ref-type="fig" rid="F3">Figure 3C</xref>).</p>
</sec>
<sec>
<title>Screening for feature genes <italic>via</italic> machine learning</title>
<p>We used R software to perform machine learning analysis on 69 DEGs. The feature genes selected by LASSO (<xref ref-type="fig" rid="F4">Figures 4A,B</xref>) and SVM-RFE (<xref ref-type="fig" rid="F4">Figures 4C,D</xref>) were 17 and 10, respectively. The number of random forest trees with the minimum error of the RF model was 272 (<xref ref-type="fig" rid="F4">Figure 4E</xref>). The 12 genes with an importance value &#x0003E;3 were selected as disease-specific genes (<xref ref-type="fig" rid="F4">Figure 4F</xref>). The feature genes screened by the algorithms are shown in <xref ref-type="table" rid="T2">Table 2</xref>.</p>
<fig id="F4" position="float">
<label>Figure 4</label>
<caption><p>Screening for feature genes. <bold>(A)</bold> Identification of the optimal penalization coefficient lambda (&#x003BB;) in the LASSO model. <bold>(B)</bold> Cross-validation for tuning parameter selection in the LASSO model. <bold>(C,D)</bold> A plot of genes selection <italic>via</italic> SVM-RFE algorithm. <bold>(E)</bold> The influence of the number of decision trees on the error rate. The <italic>x-</italic>axis represents the number of decision trees, and the <italic>y</italic>-axis indicates the error rate. <bold>(F)</bold> Results of the Gini coefficient method in RF model. The <italic>x</italic>-axis indicates the genetic variable, and the <italic>y</italic>-axis represents the importance index.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fneur-13-1014346-g0004.tif"/>
</fig>
<table-wrap position="float" id="T2">
<label>Table 2</label>
<caption><p>Feature genes screened by machine learning algorithms.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th valign="top" align="left"><bold>Algorithms</bold></th>
<th valign="top" align="left"><bold>Genes</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">LASSO</td>
<td valign="top" align="left">CPD, CLEC4D, CD163, CD19, ANKRD22, CD79B, HIST1H4D, HIST1H4H, TIMM8A, CLIC3, HTRA1, MAOA, LY96, PRSS33, FCGR3B, METTL7B, FOLR3</td>
</tr>
<tr>
<td valign="top" align="left">SVM-RFE</td>
<td valign="top" align="left">CLEC4D, ZNF439, PGLYRP1, HECW2, FAIM3, ANKRD22, CD79A, EVL, LY96, CD72</td>
</tr>
<tr>
<td valign="top" align="left">RF</td>
<td valign="top" align="left">ID3, EVL, FLT3LG, CPD, CD163, S100A12, SRPK1, KCNJ15, SLC22A4, ARG1, HECW2, CD19</td>
</tr>
</tbody>
</table>
</table-wrap>
</sec>
<sec>
<title>Effectiveness of machine learning models</title>
<p>To evaluate the prediction performance of the machine learning model, we first constructed the model by LASSO, SVM-RFE, and RF. In the training dataset and GSE22255 and GSE195442 test datasets, the AUC of the LASSO model was 0.969, 0.890, and 1.000, respectively (<xref ref-type="fig" rid="F5">Figures 5A&#x02013;C</xref>); the AUC of the SVM-RFE model was 0.957, 0.805, 1.000 (<xref ref-type="fig" rid="F5">Figures 5D&#x02013;F</xref>), respectively, and the AUC of the RF model was 0.947, 0.935, 1.000 (<xref ref-type="fig" rid="F5">Figures 5G&#x02013;I</xref>), respectively. In addition, the models have good sensitivity and specificity (<xref ref-type="table" rid="T3">Table 3</xref>).</p>
<fig id="F5" position="float">
<label>Figure 5</label>
<caption><p>Model accuracy evaluation. <bold>(A&#x02013;C)</bold> The ROC curves for using LASSO to estimate accuracy in training, GSE22255, and GSE195442 datasets. <bold>(D&#x02013;F)</bold> The ROC curves for using SVM-RFE to estimate accuracy in training, GSE22255, and GSE195442 datasets. <bold>(G&#x02013;I)</bold> The ROC curves for using RF to estimate accuracy in training, GSE22255, and GSE195442 datasets.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fneur-13-1014346-g0005.tif"/>
</fig>
<table-wrap position="float" id="T3">
<label>Table 3</label>
<caption><p>Comparison of ischemic stroke diagnosis models based on machine learning.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th valign="top" align="left"><bold>Models</bold></th>
<th valign="top" align="left"><bold>Datasets</bold></th>
<th valign="top" align="center"><bold>AUC</bold></th>
<th valign="top" align="center"><bold>Sensitivity</bold></th>
<th valign="top" align="center"><bold>Specificity</bold></th>
<th valign="top" align="center"><bold>Accuracy</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">LASSO</td>
<td valign="top" align="left">Train</td>
<td valign="top" align="center">0.969 (0.942&#x02013;0.989)</td>
<td valign="top" align="center">0.929</td>
<td valign="top" align="center">0.946</td>
<td valign="top" align="center">0.938</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">GSE22255</td>
<td valign="top" align="center">0.890 (0.768&#x02013;0.975)</td>
<td valign="top" align="center">0.850</td>
<td valign="top" align="center">0.850</td>
<td valign="top" align="center">0.850</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">GSE195442</td>
<td valign="top" align="center">1.000 (1.000&#x02013;1.000)</td>
<td valign="top" align="center">1.000</td>
<td valign="top" align="center">1.000</td>
<td valign="top" align="center">1.000</td>
</tr>
<tr>
<td valign="top" align="left">SVM-RFE</td>
<td valign="top" align="left">Train</td>
<td valign="top" align="center">0.957 (0.930&#x02013;0.979)</td>
<td valign="top" align="center">0.857</td>
<td valign="top" align="center">0.946</td>
<td valign="top" align="center">0.902</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">GSE22255</td>
<td valign="top" align="center">0.805 (0.650&#x02013;0.922)</td>
<td valign="top" align="center">0.950</td>
<td valign="top" align="center">0.550</td>
<td valign="top" align="center">0.750</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">GSE195442</td>
<td valign="top" align="center">1.000 (1.000&#x02013;1.000)</td>
<td valign="top" align="center">1.000</td>
<td valign="top" align="center">1.000</td>
<td valign="top" align="center">1.000</td>
</tr>
<tr>
<td valign="top" align="left">RF</td>
<td valign="top" align="left">Train</td>
<td valign="top" align="center">0.947 (0.907&#x02013;0.980)</td>
<td valign="top" align="center">0.893</td>
<td valign="top" align="center">0.982</td>
<td valign="top" align="center">0.938</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">GSE22255</td>
<td valign="top" align="center">0.935 (0.855&#x02013;0.988)</td>
<td valign="top" align="center">0.817</td>
<td valign="top" align="center">0.883</td>
<td valign="top" align="center">0.850</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">GSE195442</td>
<td valign="top" align="center">1.000 (1.000&#x02013;1.000)</td>
<td valign="top" align="center">1.000</td>
<td valign="top" align="center">1.000</td>
<td valign="top" align="center">1.000</td>
</tr>
<tr>
<td valign="top" align="left">LASSO&#x0002B;SVM-RFE</td>
<td valign="top" align="left">Train</td>
<td valign="top" align="center">0.898 (0.853&#x02013;0.934)</td>
<td valign="top" align="center">0.777</td>
<td valign="top" align="center">0.866</td>
<td valign="top" align="center">0.822</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">GSE22255</td>
<td valign="top" align="center">0.692 (0.522&#x02013;0.840)</td>
<td valign="top" align="center">0.617</td>
<td valign="top" align="center">0.683</td>
<td valign="top" align="center">0.650</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">GSE195442</td>
<td valign="top" align="center">0.920 (0.730&#x02013;1.000)</td>
<td valign="top" align="center">1.000</td>
<td valign="top" align="center">0.900</td>
<td valign="top" align="center">0.950</td>
</tr>
<tr>
<td valign="top" align="left">SVM-RFE&#x0002B;RF</td>
<td valign="top" align="left">Train</td>
<td valign="top" align="center">0.899 (0.854&#x02013;0.939)</td>
<td valign="top" align="center">0.777</td>
<td valign="top" align="center">0.973</td>
<td valign="top" align="center">0.875</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">GSE22255</td>
<td valign="top" align="center">0.647 (0.473&#x02013;0.820)</td>
<td valign="top" align="center">0.850</td>
<td valign="top" align="center">0.500</td>
<td valign="top" align="center">0.675</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">GSE195442</td>
<td valign="top" align="center">0.850 (0.640&#x02013;1.000)</td>
<td valign="top" align="center">0.800</td>
<td valign="top" align="center">0.900</td>
<td valign="top" align="center">0.850</td>
</tr>
<tr>
<td valign="top" align="left">LASSO&#x0002B;ANN</td>
<td valign="top" align="left">Train</td>
<td valign="top" align="center">1.000 (0.999&#x02013;1.000)</td>
<td valign="top" align="center">1.000</td>
<td valign="top" align="center">1.000</td>
<td valign="top" align="center">1.000</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">GSE22255</td>
<td valign="top" align="center">0.688 (0.510&#x02013;0.845)</td>
<td valign="top" align="center">0.850</td>
<td valign="top" align="center">0.500</td>
<td valign="top" align="center">0.675</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">GSE195442</td>
<td valign="top" align="center">0.740 (0.490&#x02013;0.950)</td>
<td valign="top" align="center">0.800</td>
<td valign="top" align="center">0.500</td>
<td valign="top" align="center">0.650</td>
</tr>
<tr>
<td valign="top" align="left">SVM-RFE&#x0002B;ANN</td>
<td valign="top" align="left">Train</td>
<td valign="top" align="center">0.995 (0.988&#x02013;0.999)</td>
<td valign="top" align="center">0.946</td>
<td valign="top" align="center">0.982</td>
<td valign="top" align="center">0.964</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">GSE22255</td>
<td valign="top" align="center">0.605 (0.420&#x02013;0.771)</td>
<td valign="top" align="center">0.700</td>
<td valign="top" align="center">0.400</td>
<td valign="top" align="center">0.550</td>
</tr>
<tr>
<td valign="top" align="left">RF&#x0002B;ANN</td>
<td valign="top" align="left">Train</td>
<td valign="top" align="center">0.997 (0.991&#x02013;1.000)</td>
<td valign="top" align="center">0.964</td>
<td valign="top" align="center">1.000</td>
<td valign="top" align="center">0.982</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">GSE22255</td>
<td valign="top" align="center">0.619 (0.429&#x02013;0.787)</td>
<td valign="top" align="center">0.750</td>
<td valign="top" align="center">0.450</td>
<td valign="top" align="center">0.600</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">GSE195442</td>
<td valign="top" align="center">0.630 (0.360&#x02013;0.860)</td>
<td valign="top" align="center">0.700</td>
<td valign="top" align="center">0.400</td>
<td valign="top" align="center">0.550</td>
</tr>
</tbody>
</table>
</table-wrap>
<p>To further evaluate the prediction performance of the combination of machine learning algorithms, we constructed and validated the LASSO&#x0002B;SVM-RFE and SVM-RFE&#x0002B;RF models. The AUC, sensitivity, and specificity of the LASSO&#x0002B;SVM-RFE and SVM-RFE&#x0002B;RF models were also satisfactory, as shown in <xref ref-type="table" rid="T3">Table 3</xref>.</p>
</sec>
<sec>
<title>Construction and validation of the ANN model</title>
<p>To evaluate the prediction performance of the ANN model, we constructed and validated ANN models for feature genes screened by LASSO, SVM-RFE, and RF, respectively. The visualization of the LASSO&#x0002B;ANN, SVM-RFE&#x0002B;ANN, and RF&#x0002B;ANN models is shown in <xref ref-type="fig" rid="F6">Figures 6A,E,H</xref>. The AUC of LASSO&#x0002B;ANN, SVM-RFE&#x0002B;ANN, and RF&#x0002B;ANN models in the training dataset was 1.000, 0.995, and 0.997, respectively (<xref ref-type="fig" rid="F6">Figures 6B,F,I</xref>). The AUC of LASSO&#x0002B;ANN, SVM-RFE&#x0002B;ANN, and RF&#x0002B;ANN in the GSE22255 dataset was 0.688, 0.605, and 0.619, respectively (<xref ref-type="fig" rid="F6">Figures 6C,G,J</xref>). The AUC of LASSO&#x0002B;ANN and RF&#x0002B;ANN in the GSE195442 dataset was 0.740 and 0.630, respectively (<xref ref-type="fig" rid="F6">Figures 6D,K</xref>).</p>
<fig id="F6" position="float">
<label>Figure 6</label>
<caption><p>Development and validation of ANN models. <bold>(A,E,H)</bold> Visualization of the LASSO&#x0002B;ANN, SVM-RFE&#x0002B;ANN, and RF&#x0002B;ANN models. <bold>(B&#x02013;D)</bold> ROC analysis for model prediction of the LASSO&#x0002B;ANN in the training, GSE22255, and GSE195442 datasets. <bold>(F,G)</bold> ROC analysis for model prediction of the SVM-RFE&#x0002B;ANN in the training, GSE22255, and GSE195442 datasets. <bold>(I&#x02013;K)</bold> ROC analysis for model prediction of the RF&#x0002B;ANN model in the training, GSE22255, and GSE195442 datasets.</p></caption>
<graphic mimetype="image" mime-subtype="tiff" xlink:href="fneur-13-1014346-g0006.tif"/>
</fig>
<p>In the training dataset, the sensitivity, specificity, and accuracy of the LASSO&#x0002B;ANN model were 1.000, 1.000, and 1.000, respectively; of the SVM-RFE&#x0002B;ANN model were 0.946, 0.982, and 0.964, respectively; and of the RF&#x0002B;ANN model were 0.964, 1.000, and 0.982, respectively. In the test datasets, the sensitivity (true positive rate) was very satisfactory; however, the specificity (true negative rate) and accuracy were not good. This shows that the ANN model is very efficient at classifying positive samples and is unsuitable at classifying negative samples (<xref ref-type="table" rid="T3">Table 3</xref>).</p>
</sec>
</sec>
<sec sec-type="discussion" id="s4">
<title>Discussion</title>
<p>In this study, the 69 DEGs identified were mainly involved in the immune response and inflammatory response. Inflammation is one of the initial responses of the immune system to a stimulus. Studies have shown that the immune system plays a very important role in the acute and chronic stages of ischemic damage and in the long-term sequelae of stroke (<xref ref-type="bibr" rid="B22">22</xref>). The pathways enriched in the IS group were complement and coagulation cascades, lysosome, PPAR signaling pathway, regulation of autophagy, and TLR signaling pathway. A sudden interruption of IS blood flow can lead to vascular endothelial changes, local retention of blood cells, platelet-leukocyte adhesion, and activation of the coagulation cascade, whereas thrombin induces the expression of adhesion molecules on endothelial cells, disrupts endothelial barrier function, and activates complement C3 and C5 (<xref ref-type="bibr" rid="B23">23</xref>). TLR, as part of the innate immune system, plays an important role in the immune response of IS (<xref ref-type="bibr" rid="B24">24</xref>). After the occurrence of hypoxic-ischemic events, part of the TLRs present in the endothelial cell membranes is involved in endothelial dysfunction and plays an indispensable role in the activation of inflammatory cascades (<xref ref-type="bibr" rid="B25">25</xref>). The autophagy-lysosomal pathway participates in the clearance of aberrant cellular components to maintain protein homeostasis and normal cellular function. Evidence indicated that the impairment of this pathway during cerebral ischemia led to ischemia-induced neuronal necrosis and apoptosis (<xref ref-type="bibr" rid="B26">26</xref>).</p>
<p>Stroke is the second leading cause of disability and death worldwide. Currently, there are no effective treatments to improve stroke survival and quality of life. Early diagnosis and intervention of IS play an essential role in reducing deaths.</p>
<p>A great deal of effort has been put into post-IS management, and there are many methods that play a role in assessing unfavorable post-IS outcomes, such as real-time biosignaling (<xref ref-type="bibr" rid="B27">27</xref>), quantitative electroencephalography (qEEG) (<xref ref-type="bibr" rid="B28">28</xref>), and electromyography (<xref ref-type="bibr" rid="B29">29</xref>). Noninvasive qEEG has good discriminative power in the quantitative evaluation of neurological outcomes after stroke compared with known demographic, clinical, and radiographic prognostic markers. Electromyography (EMG) is also considered a potential predictive tool for post-stroke gait and rehabilitation management because it is sensitive to neuromuscular changes induced by IS. Myoelectric biomarkers will help detect gait changes in stroke-impaired patients and determine post-stroke rehabilitation. There are also many methods that can assist in the diagnosis of IS. The imaging biomarker of carotid plaque can also be used to predict stroke risk (<xref ref-type="bibr" rid="B30">30</xref>). To date, most studies examining stroke have used MRI or CT images, which can be difficult to diagnose in advance. Studies have found that electrocardiography (<xref ref-type="bibr" rid="B31">31</xref>) and echocardiography (<xref ref-type="bibr" rid="B32">32</xref>) can also predict IS risk. Although electrocardiography and echocardiography are noninvasive and low-cost diagnostic methods, their low sensitivity can easily lead to misdiagnosis. Therefore, it is necessary to develop a highly sensitive and accurate method for the early diagnosis of IS.</p>
<p>This study aimed to construct prediction models of IS based on transcriptomics using machine learning methods. Overall, among the eight models, the LASSO, SVM-RFE, and RF performed best with the highest values in performance (AUC, sensitivity, specificity) in the training dataset and test datasets, followed by LASSO&#x0002B;SVM-RFE and SVM-RFE&#x0002B;RF, the LASSO&#x0002B;ANN, SVM-RFE&#x0002B;ANN, and RF&#x0002B;ANN models performed worst. It demonstrated that the LASSO, SVM-RFE, and RF models could be used independently to predict the risk of IS.</p>
<p>At present, many IS risk prediction models have been established. In 2021, a case-control study in China developed a LASSO model to better identify IS. The prediction model showed good discrimination, with an AUC of 0.916 for the LASSO method using 14 features (<xref ref-type="bibr" rid="B33">33</xref>). In this study, the LASSO, SVM-RFE, and RF models performed well, and the AUC value reached more than 90%. The sensitivity, specificity, and accuracy of LASSO, SVM-RFE, and RF models were still very satisfactory in the test datasets. This indicated that the LASSO, SVM-RFE, and RF diagnostic models have diagnostic robustness and potential utility in detecting IS.</p>
<p>A radiomics study identified the selection of the LASSO combined with the SVM as the optimal method for differentiating gliosarcoma and glioblastoma (<xref ref-type="bibr" rid="B34">34</xref>). This result suggested that models constructed by combining several machine learning algorithms may result in better prediction ability than a single algorithm. Therefore, we constructed and validated the LASSO&#x0002B;SVM-RFE and SVM-RFE&#x0002B;RF models of IS. Although the AUC, sensitivity, and specificity of LASSO&#x0002B;SVM-RFE and LASSO&#x0002B;RF models were still very satisfactory, they were still slightly inferior to LASSO, SVM-RFE, and RF models. This result was the opposite of what was expected.</p>
<p>The neural network of deep learning enables the models to scale exponentially with the growing quantity and dimensionality of data, which makes deep learning particularly useful for solving complex problems (<xref ref-type="bibr" rid="B35">35</xref>). The growing popularity of deep learning in healthcare has accelerated research into its utility in the complex biology of cancer (<xref ref-type="bibr" rid="B36">36</xref>). A study found that ANN is the most suitable diagnostic model based on machine learning in skin cutaneous melanoma (<xref ref-type="bibr" rid="B37">37</xref>). In this study, to evaluate the prediction performance of the ANN model, we constructed and validated ANN models for feature genes screened by LASSO, SVM-RFE, and RF, respectively. The sensitivity value, that is, the true positive rate, reached more than 70% in the test dataset. However, the specificity value reached &#x0003C;50% in the test dataset. This showed that the ANN model is efficient at classifying positive samples and is unsuitable at classifying negative samples. This study obtained the predictive ability of each model by constructing and comparing the multiple models of IS, which provided a new method for the early diagnosis and prediction of IS.</p>
<p>This study also had some limitations. First, due to the lack of clinical data on IS in the GEO database, the clinical features of IS were not included in the diagnostic models. In addition, the insufficient sample size of IS in the GEO database may affect the diagnostic effect of the IS model.</p>
</sec>
<sec sec-type="conclusions" id="s5">
<title>Conclusion</title>
<p>In this study, we constructed and validated the LASSO, SVM-RFE, RF, and ANN disease classification models. The AUC, sensitivity, and specificity indicated that the LASSO, SVM-RFE, and RF models performed well for IS diagnosis and prediction. However, the ANN model is efficient at classifying positive samples and is unsuitable at classifying negative samples. Nevertheless, large-scale and multiple-center studies will be needed to verify our findings.</p>
</sec>
<sec sec-type="data-availability" id="s6">
<title>Data availability statement</title>
<p>The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author/s.</p>
</sec>
<sec id="s7">
<title>Author contributions</title>
<p>W-XY and C-GY conceived and designed the study. W-XY, F-FW, and Y-YP made the diagrams and tables of the article. W-XY wrote the manuscript. J-QX and M-HL revised the manuscript. All authors read and approved the final manuscript.</p>
</sec>
<sec sec-type="funding-information" id="s8">
<title>Funding</title>
<p>This study was supported by the Gansu Province Youth Science and Technology Foundation (21JR7RA421) and the Cuiying Scientific and Technological Innovation Program of Lanzhou University Second Hospital (CY2021-BJ-A16 and CY2020-MS18).</p>
</sec>
<sec sec-type="COI-statement" id="conf1">
<title>Conflict of interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="disclaimer" id="s9">
<title>Publisher&#x00027;s note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
</body>
<back>
<ref-list>
<title>References</title>
<ref id="B1">
<label>1.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Feigin</surname> <given-names>VL</given-names></name> <name><surname>Stark</surname> <given-names>BA</given-names></name> <name><surname>Johnson</surname> <given-names>CO</given-names></name> <name><surname>Roth</surname> <given-names>GA</given-names></name> <name><surname>Bisignano</surname> <given-names>C</given-names></name> <name><surname>Abady</surname> <given-names>GG</given-names></name> <etal/></person-group>. <article-title>Global, regional, and national burden of stroke and its risk factors, 1990&#x02013;2019: a systematic analysis for the global burden of disease study 2019</article-title>. <source>Lancet Neurol.</source> (<year>2021</year>) <volume>20</volume>:<fpage>795</fpage>&#x02013;<lpage>820</lpage>. <pub-id pub-id-type="doi">10.1016/s1474-4422(21)00252-0</pub-id><pub-id pub-id-type="pmid">34487721</pub-id></citation></ref>
<ref id="B2">
<label>2.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Johnson</surname> <given-names>CO</given-names></name> <name><surname>Nguyen</surname> <given-names>M</given-names></name> <name><surname>Roth</surname> <given-names>GA</given-names></name> <name><surname>Nichols</surname> <given-names>E</given-names></name> <name><surname>Alam</surname> <given-names>T</given-names></name> <name><surname>Abate</surname> <given-names>D</given-names></name> <etal/></person-group>. <article-title>global, regional, and national burden of stroke, 1990&#x02013;2016: a systematic analysis for the global burden of disease study 2016</article-title>. <source>Lancet Neurol.</source> (<year>2019</year>) <volume>18</volume>:<fpage>439</fpage>&#x02013;<lpage>58</lpage>. <pub-id pub-id-type="doi">10.1016/s1474-4422(19)30034-1</pub-id><pub-id pub-id-type="pmid">30871944</pub-id></citation></ref>
<ref id="B3">
<label>3.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wu</surname> <given-names>S</given-names></name> <name><surname>Wu</surname> <given-names>BO</given-names></name> <name><surname>Liu</surname> <given-names>M</given-names></name> <name><surname>Chen</surname> <given-names>Z</given-names></name> <name><surname>Wang</surname> <given-names>W</given-names></name> <name><surname>Anderson</surname> <given-names>CS</given-names></name> <etal/></person-group>. <article-title>Stroke in China: advances and challenges in epidemiology, prevention, and management</article-title>. <source>Lancet Neurol.</source> (<year>2019</year>) <volume>18</volume>:<fpage>394</fpage>&#x02013;<lpage>405</lpage>. <pub-id pub-id-type="doi">10.1016/S1474-4422(18)30500-3</pub-id><pub-id pub-id-type="pmid">30878104</pub-id></citation></ref>
<ref id="B4">
<label>4.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ma</surname> <given-names>Q</given-names></name> <name><surname>Li</surname> <given-names>R</given-names></name> <name><surname>Wang</surname> <given-names>L</given-names></name> <name><surname>Yin</surname> <given-names>P</given-names></name> <name><surname>Wang</surname> <given-names>Y</given-names></name> <name><surname>Yan</surname> <given-names>C</given-names></name> <etal/></person-group>. <article-title>Temporal trend and attributable risk factors of stroke burden in China, 1990&#x02013;2019: an analysis for the Global Burden of Disease Study 2019</article-title>. <source>Lancet Public Health.</source> (<year>2021</year>) <volume>6</volume>:<fpage>e897</fpage>&#x02013;<lpage>906</lpage>. <pub-id pub-id-type="doi">10.1016/S2468-2667(21)00228-0</pub-id><pub-id pub-id-type="pmid">34838196</pub-id></citation></ref>
<ref id="B5">
<label>5.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kivioja</surname> <given-names>R</given-names></name> <name><surname>Pietil&#x000E4;</surname> <given-names>A</given-names></name> <name><surname>Martinez-Majander</surname> <given-names>N</given-names></name> <name><surname>Gordin</surname> <given-names>D</given-names></name> <name><surname>Havulinna</surname> <given-names>AS</given-names></name> <name><surname>Salomaa</surname> <given-names>V</given-names></name> <etal/></person-group>. <article-title>Risk factors for early-onset ischemic stroke: a case-control study</article-title>. <source>J Am Heart Assoc.</source> (<year>2018</year>) <volume>7</volume>:<fpage>e009774</fpage>. <pub-id pub-id-type="doi">10.1161/JAHA.118.009774</pub-id><pub-id pub-id-type="pmid">30608196</pub-id></citation></ref>
<ref id="B6">
<label>6.</label>
<citation citation-type="journal"><person-group person-group-type="author"><collab>Ekkert A &#x00160;liachtenko A Grigaite J Burnyte B Utkus A Jatu&#x0017D;is D </collab></person-group>. <article-title>Ischemic stroke genetics: What is new and how to apply it in clinical practice?</article-title> <source>Genes</source>. (<year>2021</year>) <volume>13</volume>:<fpage>48</fpage>. <pub-id pub-id-type="doi">10.3390/genes13010048</pub-id><pub-id pub-id-type="pmid">35052389</pub-id></citation></ref>
<ref id="B7">
<label>7.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>S&#x000F6;derholm</surname> <given-names>M</given-names></name> <name><surname>Pedersen</surname> <given-names>A</given-names></name> <name><surname>Lorentzen</surname> <given-names>E</given-names></name> <name><surname>Stanne</surname> <given-names>TM</given-names></name> <name><surname>Bevan</surname> <given-names>S</given-names></name> <name><surname>Olsson</surname> <given-names>M</given-names></name> <etal/></person-group>. <article-title>Genome-wide association meta-analysis of functional outcome after ischemic stroke</article-title>. <source>Neurology.</source> (<year>2019</year>) <volume>92</volume>:<fpage>e1271</fpage>&#x02013;<lpage>83</lpage>. <pub-id pub-id-type="doi">10.1212/WNL.0000000000007138</pub-id><pub-id pub-id-type="pmid">30796134</pub-id></citation></ref>
<ref id="B8">
<label>8.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hopyan</surname> <given-names>J</given-names></name> <name><surname>Ciarallo</surname> <given-names>A</given-names></name> <name><surname>Dowlatshahi</surname> <given-names>D</given-names></name> <name><surname>Howard</surname> <given-names>P</given-names></name> <name><surname>John</surname> <given-names>V</given-names></name> <name><surname>Yeung</surname> <given-names>R</given-names></name> <etal/></person-group>. <article-title>Certainty of stroke diagnosis: incremental benefit with CT perfusion over noncontrast CT and CT angiography</article-title>. <source>Radiology.</source> (<year>2010</year>) <volume>255</volume>:<fpage>142</fpage>&#x02013;<lpage>53</lpage>. <pub-id pub-id-type="doi">10.1148/radiol.09091021</pub-id><pub-id pub-id-type="pmid">20308452</pub-id></citation></ref>
<ref id="B9">
<label>9.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Campbell</surname> <given-names>BCV</given-names></name> <name><surname>Khatri</surname> <given-names>P</given-names></name></person-group>. <article-title>Stroke</article-title>. <source>Lancet.</source> (<year>2020</year>) <volume>396</volume>:<fpage>129</fpage>&#x02013;<lpage>42</lpage>. <pub-id pub-id-type="doi">10.1016/S0140-6736(20)31179-X</pub-id><pub-id pub-id-type="pmid">32653056</pub-id></citation></ref>
<ref id="B10">
<label>10.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Montellano</surname> <given-names>FA</given-names></name> <name><surname>Ungeth&#x000FC;m</surname> <given-names>K</given-names></name> <name><surname>Ramiro</surname> <given-names>L</given-names></name> <name><surname>Nacu</surname> <given-names>A</given-names></name> <name><surname>Hellwig</surname> <given-names>S</given-names></name> <name><surname>Fluri</surname> <given-names>F</given-names></name> <etal/></person-group>. <article-title>Role of blood-based biomarkers in ischemic stroke prognosis: a systematic review</article-title>. <source>Stroke.</source> (<year>2021</year>) <volume>52</volume>:<fpage>543</fpage>&#x02013;<lpage>51</lpage>. <pub-id pub-id-type="doi">10.1161/STROKEAHA.120.029232</pub-id><pub-id pub-id-type="pmid">33617344</pub-id></citation></ref>
<ref id="B11">
<label>11.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fahey</surname> <given-names>M</given-names></name> <name><surname>Crayton</surname> <given-names>E</given-names></name> <name><surname>Wolfe</surname> <given-names>C</given-names></name> <name><surname>Douiri</surname> <given-names>A</given-names></name></person-group>. <article-title>Clinical prediction models for mortality and functional outcome following ischemic stroke: a systematic review and meta-analysis</article-title>. <source>PLoS One.</source> (<year>2018</year>) <volume>13</volume>:<fpage>e0185402</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0185402</pub-id><pub-id pub-id-type="pmid">29377923</pub-id></citation></ref>
<ref id="B12">
<label>12.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Van Calster</surname> <given-names>B</given-names></name> <name><surname>Wynants</surname> <given-names>L</given-names></name></person-group>. <article-title>Machine learning in medicine</article-title>. <source>N Engl J Med.</source> (<year>2019</year>) <volume>380</volume>:<fpage>2588</fpage>. <pub-id pub-id-type="doi">10.1056/NEJMc1906060</pub-id><pub-id pub-id-type="pmid">31242381</pub-id></citation></ref>
<ref id="B13">
<label>13.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Heo</surname> <given-names>J</given-names></name> <name><surname>Yoon</surname> <given-names>JG</given-names></name> <name><surname>Park</surname> <given-names>H</given-names></name> <name><surname>Kim</surname> <given-names>YD</given-names></name> <name><surname>Nam</surname> <given-names>HS</given-names></name> <name><surname>Heo</surname> <given-names>JH</given-names></name></person-group>. <article-title>Machine learning-based model for prediction of outcomes in acute stroke</article-title>. <source>Stroke.</source> (<year>2019</year>) <volume>50</volume>:<fpage>1263</fpage>&#x02013;<lpage>5</lpage>. <pub-id pub-id-type="doi">10.1161/STROKEAHA.118.024293</pub-id><pub-id pub-id-type="pmid">30890116</pub-id></citation></ref>
<ref id="B14">
<label>14.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lee</surname> <given-names>H</given-names></name> <name><surname>Lee</surname> <given-names>EJ</given-names></name> <name><surname>Ham</surname> <given-names>S</given-names></name> <name><surname>Lee</surname> <given-names>HB</given-names></name> <name><surname>Lee</surname> <given-names>JS</given-names></name> <name><surname>Kwon</surname> <given-names>SU</given-names></name> <etal/></person-group>. <article-title>Machine learning approach to identify stroke within 45 hours</article-title>. <source>Stroke.</source> (<year>2020</year>) <volume>51</volume>:<fpage>860</fpage>&#x02013;<lpage>6</lpage>. <pub-id pub-id-type="doi">10.1161/STROKEAHA.119.027611</pub-id><pub-id pub-id-type="pmid">31987014</pub-id></citation></ref>
<ref id="B15">
<label>15.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chun</surname> <given-names>M</given-names></name> <name><surname>Clarke</surname> <given-names>R</given-names></name> <name><surname>Cairns</surname> <given-names>BJ</given-names></name> <name><surname>Clifton</surname> <given-names>D</given-names></name> <name><surname>Bennett</surname> <given-names>D</given-names></name> <name><surname>Chen</surname> <given-names>Y</given-names></name> <etal/></person-group>. <article-title>Stroke risk prediction using machine learning: a prospective cohort study of 05 million Chinese adults</article-title>. <source>J Am Med Inform Assoc.</source> (<year>2021</year>) <volume>28</volume>:<fpage>1719</fpage>&#x02013;<lpage>27</lpage>. <pub-id pub-id-type="doi">10.1093/jamia/ocab068</pub-id><pub-id pub-id-type="pmid">33969418</pub-id></citation></ref>
<ref id="B16">
<label>16.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wang</surname> <given-names>S</given-names></name> <name><surname>Dai</surname> <given-names>Y</given-names></name> <name><surname>Shen</surname> <given-names>J</given-names></name> <name><surname>Xuan</surname> <given-names>J</given-names></name></person-group>. <article-title>Research on expansion and classification of imbalanced data based on SMOTE algorithm</article-title>. <source>Sci Rep.</source> (<year>2021</year>) <volume>11</volume>:<fpage>24039</fpage>. <pub-id pub-id-type="doi">10.1038/s41598-021-03430-5</pub-id><pub-id pub-id-type="pmid">34912009</pub-id></citation></ref>
<ref id="B17">
<label>17.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Barr</surname> <given-names>TL</given-names></name> <name><surname>Conley</surname> <given-names>Y</given-names></name> <name><surname>Ding</surname> <given-names>J</given-names></name> <name><surname>Dillman</surname> <given-names>A</given-names></name> <name><surname>Warach</surname> <given-names>S</given-names></name> <name><surname>Singleton</surname> <given-names>A</given-names></name> <etal/></person-group>. <article-title>Genomic biomarkers and cellular pathways of ischemic stroke by RNA gene expression profiling</article-title>. <source>Neurology.</source> (<year>2010</year>) <volume>75</volume>:<fpage>1009</fpage>&#x02013;<lpage>14</lpage>. <pub-id pub-id-type="doi">10.1212/WNL.0b013e3181f2b37f</pub-id><pub-id pub-id-type="pmid">20837969</pub-id></citation></ref>
<ref id="B18">
<label>18.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stamova</surname> <given-names>B</given-names></name> <name><surname>Jickling</surname> <given-names>GC</given-names></name> <name><surname>Ander</surname> <given-names>BP</given-names></name> <name><surname>Zhan</surname> <given-names>X</given-names></name> <name><surname>Liu</surname> <given-names>D</given-names></name> <name><surname>Turner</surname> <given-names>R</given-names></name> <etal/></person-group>. <article-title>Gene expression in peripheral immune cells following cardioembolic stroke is sexually dimorphic</article-title>. <source>PLoS ONE.</source> (<year>2014</year>) <volume>9</volume>:<fpage>e102550</fpage>. <pub-id pub-id-type="doi">10.1371/journal.pone.0102550</pub-id><pub-id pub-id-type="pmid">25036109</pub-id></citation></ref>
<ref id="B19">
<label>19.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Barr</surname> <given-names>TL</given-names></name> <name><surname>VanGilder</surname> <given-names>R</given-names></name> <name><surname>Rellick</surname> <given-names>S</given-names></name> <name><surname>Brooks</surname> <given-names>SD</given-names></name> <name><surname>Doll</surname> <given-names>DN</given-names></name> <name><surname>Lucke-Wold</surname> <given-names>AN</given-names></name> <etal/></person-group>. <article-title>A genomic profile of the immune response to stroke with implications for stroke recovery</article-title>. <source>Biol Res Nurs.</source> (<year>2015</year>) <volume>17</volume>:<fpage>248</fpage>&#x02013;<lpage>56</lpage>. <pub-id pub-id-type="doi">10.1177/1099800414546492</pub-id><pub-id pub-id-type="pmid">25124890</pub-id></citation></ref>
<ref id="B20">
<label>20.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Krug</surname> <given-names>T</given-names></name> <name><surname>Gabriel</surname> <given-names>JP</given-names></name> <name><surname>Taipa</surname> <given-names>R</given-names></name> <name><surname>Fonseca</surname> <given-names>BV</given-names></name> <name><surname>Domingues-Montanari</surname> <given-names>S</given-names></name> <name><surname>Fernandez-Cadenas</surname> <given-names>I</given-names></name> <etal/></person-group>. <article-title>TTC7B emerges as a novel risk factor for ischemic stroke through the convergence of several genome-wide approaches</article-title>. <source>J Cereb Blood Flow Metab.</source> (<year>2012</year>) <volume>32</volume>:<fpage>1061</fpage>&#x02013;<lpage>72</lpage>. <pub-id pub-id-type="doi">10.1038/jcbfm.2012.24</pub-id><pub-id pub-id-type="pmid">22453632</pub-id></citation></ref>
<ref id="B21">
<label>21.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yang</surname> <given-names>J</given-names></name> <name><surname>Hao</surname> <given-names>J</given-names></name> <name><surname>Lin</surname> <given-names>Y</given-names></name> <name><surname>Guo</surname> <given-names>Y</given-names></name> <name><surname>Liao</surname> <given-names>K</given-names></name> <name><surname>Yang</surname> <given-names>M</given-names></name> <etal/></person-group>. <article-title>Profile and Functional Prediction of Plasma Exosome-Derived CircRNAs From Acute Ischemic Stroke Patients</article-title>. <source>Front Genet.</source> (<year>2022</year>) <volume>13</volume>:<fpage>810974</fpage>. <pub-id pub-id-type="doi">10.3389/fgene.2022.810974</pub-id><pub-id pub-id-type="pmid">35360855</pub-id></citation></ref>
<ref id="B22">
<label>22.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Iadecola</surname> <given-names>C</given-names></name> <name><surname>Buckwalter</surname> <given-names>MS</given-names></name> <name><surname>Anrather</surname> <given-names>J</given-names></name></person-group>. <article-title>Immune responses to stroke: mechanisms, modulation, and therapeutic potential</article-title>. <source>J Clin Invest.</source> (<year>2020</year>) <volume>130</volume>:<fpage>2777</fpage>&#x02013;<lpage>88</lpage>. <pub-id pub-id-type="doi">10.1172/JCI135530</pub-id><pub-id pub-id-type="pmid">32391806</pub-id></citation></ref>
<ref id="B23">
<label>23.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Amara</surname> <given-names>U</given-names></name> <name><surname>Rittirsch</surname> <given-names>D</given-names></name> <name><surname>Flierl</surname> <given-names>M</given-names></name> <name><surname>Bruckner</surname> <given-names>U</given-names></name> <name><surname>Klos</surname> <given-names>A</given-names></name> <name><surname>Gebhard</surname> <given-names>F</given-names></name> <etal/></person-group>. <article-title>Interaction between the coagulation and complement system</article-title>. <source>Adv Exp Med Biol.</source> (<year>2008</year>) <volume>632</volume>:<fpage>71</fpage>&#x02013;<lpage>9</lpage>. <pub-id pub-id-type="doi">10.1007/978-0-387-78952-1_6</pub-id><pub-id pub-id-type="pmid">19025115</pub-id></citation></ref>
<ref id="B24">
<label>24.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fadakar</surname> <given-names>K</given-names></name> <name><surname>Dadkhahfar</surname> <given-names>S</given-names></name> <name><surname>Esmaeili</surname> <given-names>A</given-names></name> <name><surname>Rezaei</surname> <given-names>N</given-names></name></person-group>. <article-title>The role of toll-like receptors (TLRs) in stroke</article-title>. <source>Rev Neurosci.</source> (<year>2014</year>) <volume>25</volume>:<fpage>699</fpage>&#x02013;<lpage>712</lpage>. <pub-id pub-id-type="doi">10.1515/revneuro-2013-0069</pub-id><pub-id pub-id-type="pmid">24807166</pub-id></citation></ref>
<ref id="B25">
<label>25.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ashayeri Ahmadabad</surname> <given-names>R</given-names></name> <name><surname>Mirzaasgari</surname> <given-names>Z</given-names></name> <name><surname>Gorji</surname> <given-names>A</given-names></name> <name><surname>Khaleghi Ghadiri</surname> <given-names>M</given-names></name></person-group>. <article-title>Toll-like receptor signaling pathways: novel therapeutic targets for cerebrovascular disorders</article-title>. <source>Int J Mol Sci</source>. (<year>2021</year>) <volume>22</volume>:<fpage>6153</fpage>. <pub-id pub-id-type="doi">10.3390/ijms22116153</pub-id><pub-id pub-id-type="pmid">34200356</pub-id></citation></ref>
<ref id="B26">
<label>26.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chen</surname> <given-names>C</given-names></name> <name><surname>Qin</surname> <given-names>H</given-names></name> <name><surname>Tan</surname> <given-names>J</given-names></name> <name><surname>Hu</surname> <given-names>Z</given-names></name> <name><surname>Zeng</surname> <given-names>L</given-names></name></person-group>. <article-title>The role of ubiquitin-proteasome pathway and autophagy-lysosome pathway in Cerebral Ischemia</article-title>. <source>Oxid Med Cell Longev.</source> (<year>2020</year>) <volume>2020</volume>:<fpage>5457049</fpage>. <pub-id pub-id-type="doi">10.1155/2020/5457049</pub-id><pub-id pub-id-type="pmid">32089771</pub-id></citation></ref>
<ref id="B27">
<label>27.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Choi</surname> <given-names>YA</given-names></name> <name><surname>Park</surname> <given-names>SJ</given-names></name> <name><surname>Jun</surname> <given-names>JA</given-names></name> <name><surname>Pyo</surname> <given-names>CS</given-names></name> <name><surname>Cho</surname> <given-names>KH</given-names></name> <name><surname>Lee</surname> <given-names>HS</given-names></name> <etal/></person-group>. <article-title>Deep learning-based stroke disease prediction system using real-time bio signals</article-title>. <source>Sensors</source>. (<year>2021</year>) <volume>21</volume>:<fpage>4269</fpage>. <pub-id pub-id-type="doi">10.3390/s21134269</pub-id><pub-id pub-id-type="pmid">34206540</pub-id></citation></ref>
<ref id="B28">
<label>28.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bentes</surname> <given-names>C</given-names></name> <name><surname>Peralta</surname> <given-names>AR</given-names></name> <name><surname>Viana</surname> <given-names>P</given-names></name> <name><surname>Martins</surname> <given-names>H</given-names></name> <name><surname>Morgado</surname> <given-names>C</given-names></name> <name><surname>Casimiro</surname> <given-names>C</given-names></name> <etal/></person-group>. <article-title>Quantitative EEG and functional outcome following acute ischemic stroke</article-title>. <source>Clin Neurophysiol.</source> (<year>2018</year>) <volume>129</volume>:<fpage>1680</fpage>&#x02013;<lpage>7</lpage>. <pub-id pub-id-type="doi">10.1016/j.clinph.2018.05.021</pub-id><pub-id pub-id-type="pmid">29935475</pub-id></citation></ref>
<ref id="B29">
<label>29.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hussain</surname> <given-names>I</given-names></name> <name><surname>Park</surname> <given-names>SJ</given-names></name></person-group>. <article-title>Prediction of myoelectric biomarkers in post-stroke gait</article-title>. <source>Sensors</source>. (<year>2021</year>) <volume>21</volume>:<fpage>5334</fpage>. <pub-id pub-id-type="doi">10.3390/s21165334</pub-id><pub-id pub-id-type="pmid">34450776</pub-id></citation></ref>
<ref id="B30">
<label>30.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Saba</surname> <given-names>L</given-names></name> <name><surname>Saam</surname> <given-names>T</given-names></name> <name><surname>J&#x000E4;ger</surname> <given-names>HR</given-names></name> <name><surname>Yuan</surname> <given-names>C</given-names></name> <name><surname>Hatsukami</surname> <given-names>TS</given-names></name> <name><surname>Saloner</surname> <given-names>D</given-names></name> <etal/></person-group>. <article-title>Imaging biomarkers of vulnerable carotid plaques for stroke risk prediction and their potential clinical implications</article-title>. <source>Lancet Neurol.</source> (<year>2019</year>) <volume>18</volume>:<fpage>559</fpage>&#x02013;<lpage>72</lpage>. <pub-id pub-id-type="doi">10.1016/S1474-4422(19)30035-3</pub-id><pub-id pub-id-type="pmid">30954372</pub-id></citation></ref>
<ref id="B31">
<label>31.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pirinen</surname> <given-names>J</given-names></name> <name><surname>Eranti</surname> <given-names>A</given-names></name> <name><surname>Knekt</surname> <given-names>P</given-names></name> <name><surname>Lehto</surname> <given-names>M</given-names></name> <name><surname>Martinez-Majander</surname> <given-names>N</given-names></name> <name><surname>Aro</surname> <given-names>AL</given-names></name> <etal/></person-group>. <article-title>ECG markers associated with ischemic stroke at young age - a case-control study</article-title>. <source>Ann Med.</source> (<year>2017</year>) <volume>49</volume>:<fpage>562</fpage>&#x02013;<lpage>8</lpage>. <pub-id pub-id-type="doi">10.1080/07853890.2017.1348620</pub-id><pub-id pub-id-type="pmid">28657357</pub-id></citation></ref>
<ref id="B32">
<label>32.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>O&#x00027;Neal</surname> <given-names>WT</given-names></name> <name><surname>Almahmoud</surname> <given-names>MF</given-names></name> <name><surname>Qureshi</surname> <given-names>WT</given-names></name> <name><surname>Soliman</surname> <given-names>EZ</given-names></name></person-group>. <article-title>Electrocardiographic and echocardiographic left ventricular hypertrophy in the prediction of stroke in the elderly</article-title>. <source>J Stroke Cerebrovasc Dis.</source> (<year>2015</year>) <volume>24</volume>:<fpage>1991</fpage>&#x02013;<lpage>7</lpage>. <pub-id pub-id-type="doi">10.1016/j.jstrokecerebrovasdis.2015.04.044</pub-id><pub-id pub-id-type="pmid">26153509</pub-id></citation></ref>
<ref id="B33">
<label>33.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Meng</surname> <given-names>Z</given-names></name> <name><surname>Wang</surname> <given-names>M</given-names></name> <name><surname>Guo</surname> <given-names>S</given-names></name> <name><surname>Zhou</surname> <given-names>Y</given-names></name> <name><surname>Zheng</surname> <given-names>M</given-names></name> <name><surname>Liu</surname> <given-names>M</given-names></name> <etal/></person-group>. <article-title>Development and validation of a LASSO prediction model for better identification of ischemic stroke: a case-control study in China</article-title>. <source>Front Aging Neurosci.</source> (<year>2021</year>) <volume>13</volume>:<fpage>630437</fpage>. <pub-id pub-id-type="doi">10.3389/fnagi.2021.630437</pub-id><pub-id pub-id-type="pmid">34305566</pub-id></citation></ref>
<ref id="B34">
<label>34.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Qian</surname> <given-names>Z</given-names></name> <name><surname>Zhang</surname> <given-names>L</given-names></name> <name><surname>Hu</surname> <given-names>J</given-names></name> <name><surname>Chen</surname> <given-names>S</given-names></name> <name><surname>Chen</surname> <given-names>H</given-names></name> <name><surname>Shen</surname> <given-names>H</given-names></name> <etal/></person-group>. <article-title>Machine learning-based analysis of magnetic resonance radiomics for the classification of gliosarcoma and glioblastoma</article-title>. <source>Front Oncol.</source> (<year>2021</year>) <volume>11</volume>:<fpage>699789</fpage>. <pub-id pub-id-type="doi">10.3389/fonc.2021.774369</pub-id><pub-id pub-id-type="pmid">34737968</pub-id></citation></ref>
<ref id="B35">
<label>35.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Lecun</surname> <given-names>Y</given-names></name> <name><surname>Bengio</surname> <given-names>Y</given-names></name> <name><surname>Hinton</surname> <given-names>G</given-names></name></person-group>. <article-title>Deep learning</article-title>. <source>Nature.</source> (<year>2015</year>) <volume>521</volume>:<fpage>436</fpage>&#x02013;<lpage>44</lpage>. <pub-id pub-id-type="doi">10.1038/nature14539</pub-id><pub-id pub-id-type="pmid">26017442</pub-id></citation></ref>
<ref id="B36">
<label>36.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Tran</surname> <given-names>KA</given-names></name> <name><surname>Kondrashova</surname> <given-names>O</given-names></name> <name><surname>Bradley</surname> <given-names>A</given-names></name> <name><surname>Williams</surname> <given-names>ED</given-names></name> <name><surname>Pearson</surname> <given-names>JV</given-names></name> <name><surname>Waddell</surname> <given-names>N</given-names></name></person-group>. <article-title>Deep learning in cancer diagnosis, prognosis and treatment selection</article-title>. <source>Genome Med.</source> (<year>2021</year>) <volume>13</volume>:<fpage>152</fpage>. <pub-id pub-id-type="doi">10.1186/s13073-021-00968-x</pub-id><pub-id pub-id-type="pmid">34579788</pub-id></citation></ref>
<ref id="B37">
<label>37.</label>
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ju</surname> <given-names>A</given-names></name> <name><surname>Tang</surname> <given-names>J</given-names></name> <name><surname>Chen</surname> <given-names>S</given-names></name> <name><surname>Fu</surname> <given-names>Y</given-names></name> <name><surname>Luo</surname> <given-names>Y</given-names></name></person-group>. <article-title>Pyroptosis-related gene signatures can robustly diagnose skin cutaneous melanoma and predict the prognosis</article-title>. <source>Front Oncol.</source> (<year>2021</year>) <volume>11</volume>:<fpage>709077</fpage>. <pub-id pub-id-type="doi">10.3389/fonc.2021.709077</pub-id><pub-id pub-id-type="pmid">34327145</pub-id></citation></ref>
</ref-list>
<glossary>
<def-list>
<title>Abbreviations</title>
<def-item><term>IS</term>
<def><p>ischemic stroke</p></def></def-item>
<def-item><term>GEO</term>
<def><p>Gene Expression Omnibus</p></def></def-item>
<def-item><term>DEGs</term>
<def><p>differentially expressed genes</p></def></def-item>
<def-item><term>LASSO</term>
<def><p>least absolute shrinkage and selection operator</p></def></def-item>
<def-item><term>SVM-RFE</term>
<def><p>support vector machine-recursive feature elimination</p></def></def-item>
<def-item><term>RF</term>
<def><p>Random Forest</p></def></def-item>
<def-item><term>ANN</term>
<def><p>artificial neural network</p></def></def-item>
<def-item><term>GO</term>
<def><p>Gene Ontology</p></def></def-item>
<def-item><term>BP</term>
<def><p>biological processes</p></def></def-item>
<def-item><term>CC</term>
<def><p>cellular components</p></def></def-item>
<def-item><term>MF</term>
<def><p>molecular functions</p></def></def-item>
<def-item><term>GSEA</term>
<def><p>gene set enrichment analysis</p></def></def-item>
<def-item><term>AUC</term>
<def><p>area under the curve</p></def></def-item>
<def-item><term>ROC</term>
<def><p>receiver operating characteristic.</p></def></def-item>
</def-list>
</glossary> 
</back>
</article>