A machine learning-based predictive nomogram for early neurological improvement after thrombolysis in acute ischemic stroke

Lv, Bing-Hua; Deng, Hao-wei; Qin, Zuo-yv; Meng, Ning-qin; Weng, Gui-ming; Hu, Rui-Ting; Qin, Chao

doi:10.3389/fneur.2025.1662498

ORIGINAL RESEARCH article

Front. Neurol., 12 November 2025

Sec. Stroke

Volume 16 - 2025 | https://doi.org/10.3389/fneur.2025.1662498

This article is part of the Research TopicBrain Cytoprotection for Reperfusion Injury after Acute Ischemic StrokeView all 13 articles

A machine learning-based predictive nomogram for early neurological improvement after thrombolysis in acute ischemic stroke

Bing-Hua Lv¹^†

Hao-wei Deng¹^†

Zuo-yv Qin¹

Ning-qin Meng²

Gui-ming Weng¹

Rui-Ting Hu³^*

Chao Qin¹^*

¹Department of Neurology, The First Affiliated Hospital of Guangxi Medical University, Nanning, China
²Department of Rheumatology and Immunology, The First Affiliated Hospital of Guangxi Medical University, Nanning, China
³Department of Neurology, Minzu Hospital of Guangxi Medical University, Nanning, China

Background: Early neurological improvement (ENI) is a critical prognostic indicator for acute ischemic stroke (AIS) patients undergoing intravenous thrombolysis with recombinant tissue plasminogen activator (rt-PA). This study aimed to develop and validate a machine learning (ML)-based model for predicting ENI using clinical and biochemical data.

Methods: Clinical data from 217 AIS patients (97 ENI, 120 non-ENI) were retrospectively analyzed. Significant baseline differences were identified between groups, including hemorrhage, onset-to-needle time (ONT), neutrophil-to-lymphocyte ratio (NLR), weight, and activated partial thromboplastin time (APTT). Four ML algorithms, including Multilayer Perceptron (MLP), Random Forest (RF), Support Vector Machine (SVM), and XGBoost, were implemented. Model performance was evaluated via area under the receiver operating characteristic curve (AUC). Key predictors were identified by intersecting top-ranked features from all algorithms, followed by logistic regression modeling and nomogram visualization.

Results: The MLP model achieved the highest AUC (0.77) in the testing set, outperforming RF (0.72), SVM (0.63), and XGBoost (0.68). Six overlapping parameters, including APTT, ALT/AST ratio, ONT, mean corpuscular hemoglobin concentration (MCHC), weight, and NLR, were selected as core predictors. The logistic regression model incorporating these parameters yielded an AUC of 0.74, while the nomogram demonstrated that the predictive model exhibited strong discriminative ability (C-index: 0.817) for predicting ENI in rt-PA-treated AIS patients.

Conclusion: This ML-based model effectively predicts ENI in rt-PA-treated AIS patients by integrating critical clinical and biochemical markers. Its application may optimize personalized treatment strategies, enhance clinical decision-making, and improve patient outcomes.

Introduction

In recent years, the administration of recombinant tissue plasminogen activator (rt-PA) for intravenous thrombolysis has become a cornerstone in the acute management of ischemic stroke (1). Numerous studies have highlighted the importance of early neurological improvement (ENI) as a predictor of long-term outcomes and functional independence (2, 3). Research has shown that patients who exhibit rapid improvement in their neurological status within the first few hours post-treatment are more likely to achieve favorable outcomes (4). The mechanisms underlying ENI are complex and multifactorial, involving reperfusion of ischemic but viable brain tissue, reduction of infarct size, and preservation of the blood–brain barrier (5, 6). Moreover, several blood biomarkers are being explored to identify patients most likely to benefit from rt-PA and to predict early response to treatment.

Machine learning (ML) algorithms have emerged as powerful tools in the construction of predictive models for adverse events following acute ischemic stroke (AIS), offering significant advantages over traditional statistical approaches (7, 8). By leveraging complex, non-linear relationships within large and diverse datasets, ML algorithms can identify subtle patterns and risk factors that may not be apparent through conventional analysis, thereby enhancing the accuracy and robustness of prediction. Techniques such as logistic regression, decision trees (DT), random forests (RF), support vector machines (SVM), and neural networks have been applied to forecast complications like hemorrhagic transformation, recurrent stroke, and mortality (9–11). The use of ML also supports real-time predictions and personalized medicine, enabling timely interventions to mitigate adverse outcomes.

To date, several studies reported machine-learning based prediction of future outcome in stroke patients. For instance, Wen et al. (12) reported that the model constructed by two machine-learning served as robust tools for predicting early neurological deterioration in acute ischemic stroke patients following thrombolysis. Moreover, Fan et al. (13) used four ML methods to screen and recombine the features for construction of prognostic model, and found that this model offers improved prediction accuracy that may reduce rates of misdiagnosis and missed diagnosis in patients with AIS. Regarding ENI in AIS patients undergoing rt-PA treatment, although three studies (14–16) revealed that several clinical indexes, such as diabetes mellitus history, kynurenic acid and kynurenine aminotransferase were associated with the ENI, no studies using ML algorithms to construct predictive model for ENI in AIS patients undergoing rt-PA treatment. In addition, these previous models largely rely on a limited number of variables and traditional statistical methods (e.g., logistic regression), which may not adequately capture the complex, non-linear relationships among multiple prognostic factors. Therefore, our study addresses this gap by incorporating a comprehensive set of clinical, laboratory variables and applying multiple ML algorithms to better model these interactions, thereby enhancing predictive accuracy.

Methods

Patient selection

The study adhered to the Declaration of Helsinki and was approved by the Ethics Committee of our hospital. A retrospective analysis was conducted on 266 patients with AIS who underwent rt-PA intravenous thrombolysis (IVT) at our hospital’s Stroke Center between June 1, 2020, and November 30, 2024. Inclusion criteria were as follows: individuals aged ≥18 years with a confirmed diagnosis of AIS based on CT or MRI, who received IVT within 4.5 h of stroke onset. Patients were excluded if they received bridging endovascular treatment after IVT, had incomplete clinical or laboratory data, or were discharged or deceased within 24 h. After excluding 25 patients who underwent bridging artery thrombectomy, 19 with incomplete clinical data, and 5 who were beyond the 4.5-h thrombolysis time window, a total of 217 patients who received rt-PA IVT were included in the study. The flow chart for patient selection is shown in Figure 1.

Figure 1

Flowchart showing patient selection for IVT in AIS study. Initial group: 266 patients. Exclusions: 25 for bridging artery thrombectomy, 19 for missing data, 5 beyond time window. Final included: 217 patients, divided into ENI (97) and Non-ENI (120).

Figure 1. Flow chart of the study population.

Collection of clinical data and definition of ENI

Clinical data were collected for each patient, including demographics (age, gender, height, weight, and BMI), comorbidities, the rt-PA Dosage (0.9 or 0.6 mg/kg), baseline NIHSS score, medication history, and initial laboratory tests (blood routine examination, biochemical examination, liver function test, coagulation test, renal function test, electrolyte test, and lipid test). We also calculated inflammatory cell ratios: NLR (neutrophil-to-lymphocyte ratio), PLR (platelet-to-lymphocyte ratio), and LMR (lymphocyte-to-monocyte ratio). ENI was defined as an NIHSS score decrease of ≥4 points within 24 h of hospitalization or complete recovery within 24 h (17).

Variable selection and establishment of machine learning models

Study participants were randomly divided into training (80%) and testing (20%) sets. Clinical data were standardized, and four ML algorithms—Multilayer Perceptron (MLP), Random Forest (RF), Support Vector Machine (SVM), and XGBoost—were applied to construct predictive models for ENI and screen key parameters using the clinical data. A total of 68 clinical and laboratory variables were included in the initial ML screening phase, including both raw measurements and derived ratios (such as NLR, PLR, ALT/AST). Although some variables were biologically or mathematically related, they were retained in the initial screening to avoid premature exclusion of potentially informative features. The performance of the models was quantitatively evaluated using receiver operating characteristic (ROC) curves, with the area under the curve (AUC) serving as the primary metric. The selected clinical indicators were then integrated into a logistic regression classification algorithm. A nomogram was generated to visualize the predictive value of each parameter.

For data preprocessing, the missing data were handled using multiple imputation by chained equations (MICE), and outliers were identified using the interquartile ranges (IQRs) method and winsorized where appropriate. Continuous variables were standardized using z-score normalization prior to model training. A grid search with 5-fold cross-validation was used to optimize key parameters for each ML algorithm. The optimal parameters were selected based on the highest cross-validated AUC value. The SHapley Additive exPlanations (SHAP) Python package (version 0.40.0) was used to measure the effects of the parameters on the predictive model, assessing feature importance using a game-theoretic approach.

To mitigate overfitting due to high dimensionality, we employed a conservative feature selection strategy by intersecting the top 20 features ranked by each of the four ML algorithms, resulting in a final set of 6 variables for logistic regression modeling.

Statistical analysis

Statistical analyses were conducted using R software (version 4.2.2). Continuous variables were presented as mean ± standard deviation (SD) or IQRs, while categorical variables were expressed as percentages (n, %). Continuous variables were assessed using t-tests or non-parametric Mann–Whitney U tests, as appropriate, while chi-square tests were used for categorical variables to compare baseline characteristics between the ENI and non-ENI groups. To assess potential multicollinearity among the final set of six predictors, we computed the Variance Inflation Factor (VIF) for each variable in the logistic regression model. Significant differences were considered at p < 0.05.

Results

Comparison of clinical data between ENI and non-ENI patients

Table 1 summarizes the baseline characteristics of both groups. The study included 217 patients were divided into ENI group (n = 97) and non-ENI group (n = 120) based on the achievement of ENI. Baseline characteristics comparison revealed that ENI group patients were younger (mean age 62 vs. 67.5 years) with a higher proportion of males (74.2% vs. 66.7%). There was significantly shorter onset-to-treatment time in ENI group compared with non-ENI group (150 vs. 174 min). Lower value of NEU (5.12 vs. 5.62), NEU% (66% vs. 70%) and NLR (2.81 vs. 4.03). Higher LYM (0.23 vs. 0.18), LYM% (1.73 vs. 1.52), LMR (3.16 vs. 2.48), MCV (87.99 vs. 90.39), and Fasting GLU (5.17 vs. 5.71) were found in ENI group than in non-ENI group (p < 0.05). However, higher value of weight (67.5 vs. 62.5), BMI (24.51 vs. 23.25), APTT (30.9 vs. 29.7), A/G ratio (1.40 vs. 1.30), PA (256.89 vs. 232.67), and CHE (8623.64 vs. 8,043) were found in ENI group than in non-ENI group (p < 0.05). However, no significant differences were found in resting indexes (p > 0.05). These findings suggest that younger age, shorter thrombolysis time window, reduced inflammatory status, better nutritional/metabolic condition, and appropriate anticoagulation status may be closely associated with early neurological improvement following intravenous thrombolysis.

Table 1

Table 1. Comparison of clinical data between ENI and non-ENI patients.

Predictive value of model constructed by four ML methods

Four ML methods (MLP, RF, SVM, and XGBoost) were used to construct predictive models for ENI using the clinical data. The dataset was divided into training (80%) and testing (20%) sets, comprising 173 and 44 patients, respectively. Using default parameters, all four ML methods demonstrated moderate predictive performance. The AUC values for the training set were 0.83 (MLP), 0.94 (RF), 0.85 (SVM), and 0.99 (XGBoost), while those for the testing set were 0.77 (MLP), 0.72 (RF), 0.63 (SVM), and 0.68 (XGBoost) (Figure 2; Table 2). These results indicate that MLP achieved relatively higher predictive ability compared to the other methods.

Figure 2

Four models are evaluated using confusion matrices, bar charts, and ROC curves: A) MLP, B) RF, C) SVM, and D) XGBoost. Each model's confusion matrix shows true versus predicted labels for Non-ENI and ENI. Bar charts compare testing and training metrics like accuracy, specificity, recall, F1-score, and AUC. ROC curves visualize model performance; MLP and SVM show moderate performance, while RF and XGBoost exhibit higher effectiveness. The AUC values range from 0.68 to 0.84 across models.

Figure 2. Confusion matrices of the test set (left), comparison of metrics between the training and test sets (middle), and ROC (right) in (A) MLP; (B) RF; (C) SVM; (D) XGBoost.

Table 2

Table 2. Predictive value of four machine learning models.

Establishment of predictive model based on the parameters from ML models

To refine the predictive model, we identified the top 20 parameters from each of the four ML models and overlapped them, resulting in six common parameters: APTT, ALT/AST, ONT, MCHC, Weight, and NLR (Figure 3A). The VIF value for each parameter was APTT: 1.32; ALT/AST: 1.45, ONT: 1.28, MCHC: 1.36, Weight: 1.30, and NLR: 1.41. This conservative selection reduced the feature space from near 70–6, yielding an events-per-variable (EPV) ratio of 16.2, which supports model stability. A logistic regression model was then constructed using these six parameters, yielding an AUC value of 0.74 (Figure 3B), which indicates moderate predictive performance. A nomogram was developed to visualize the predictive value of the model relative to the six parameters (Figure 3C), demonstrating that the composite model outperformed individual parameters in predicting ENI, with a bootstrap-corrected C-index of 0.817. these results suggesting that this model show good predictive accuracy for ENI in rt-PA-treated AIS patients. The sensitivity, specificity, PPV, NPV at optimal threshold were listed in Table 3.

Figure 3

A: Venn diagram shows top 20 parameters across four machine learning models: SVM, MLP, XGBoost, and RF, with overlaps indicating shared parameters. B: ROC curve of logistic regression illustrates a performance with an area under the curve (AUC) of 0.74. C: Nomogram visualizes contributions of seven predictors (Model, APTT, ALT/AST, NLR, MCHC, ONT, Weight) to a total score, highlighting their impacts on predicted outcome probabilities.

Figure 3. Establishment of predictive model based on the parameters from ML models. (A) Venn plot of the overlapping top 20 clinical parameters from each ML model. (B) ROC curve of the logistic regression model using the common clinical indexes. (C) Nomogram of the predictive model and the six parameters.

Table 3

Table 3. Sensitivity, specificity, PPV, NPV at optimal threshold.

SHAP analysis for the model

The SHAP analysis elucidated the direction and relative importance of predictive factors influencing ENI after thrombolysis, Figure 4 listed the SHAP summary plot of the top 10 features of the RF model. Among all models evaluated, APTT emerged as the most influential positive predictor, where higher values were consistently associated with better ENI outcomes, suggesting that moderately prolonged coagulation may facilitate neurorecovery post-thrombolysis. In contrast, ALT/AST ratio and MCHC demonstrated significant negative impacts across multiple models (MLP, RF, SVM, XGBoost), implying that liver dysfunction and increased blood viscosity may hinder neurological recovery. Additionally, NEU% and age was consistently associated with poorer ENI outcomes in RF, SVM, and XGBoost models, highlighting the detrimental effects of systemic inflammation and advanced age on prognosis. Notably, RBC and body weight showed positive associations in certain models (e.g., MLP, SVM), possibly reflecting beneficial hemodynamic effects. Conversely, ONT and fasting GLU levels were linked to unfavorable outcomes, underscoring the importance of timely intervention and metabolic control. In summary, APTT, ALT/AST, MCHC, NEU%, and age were identified as the most influential predictors, with their directional effects providing critical insights for risk stratification and personalized therapeutic strategies in thrombolysis management.

Figure 4

Panel A shows a SHAP summary plot with multiple features (MCV, NEU%, NLR, etc.) displaying dots colored from blue (low value) to red (high value) against SHAP values on the x-axis. Panel B is a bar chart ranking the average absolute SHAP value by feature, with MCV having the highest impact on model output.

Figure 4. (A) SHAP summary plot of the top 10 features of the RF model. The higher the SHAP value of a feature (x-axis), the higher the probability of ENI in AIS patients undergoing rt-PA treatment. Feature values are represented in color (red for high, blue for low). (B) SHAP bar plot of the top 10 features of the RF model. The x-axis shows the mean absolute SHAP value, representing the average impact of each feature on the probability of ENI. Features are ranked by importance.

Discussion

Currently, limited evidence is available for the prediction of ENI in AIS patients undergoing rt-PA treatment. The present study conducted a comprehensive analysis by a larger sample of patients to identify significant differences in various clinical and biochemical parameters between ENI and non-ENI groups. The ENI group, consisting of 97 patients, exhibited lower levels of Hemorrhage, ONT, MCV, NEU, NEU%, NLR, and Fasting GLU, while higher levels of Weight, BMI, LYM, LYM%, LMR, A/G ratio, PA, CHE, and APTT were observed compared to the non-ENI group. These findings suggest that these indices are closely associated with ENI, whereas resting indexes did not significantly differ between the two groups, indicating their limited impact on ENI development.

To further explore the predictive capacity of ML models for ENI, we employed four ML algorithms, including MLP, RF, SSVM, and XGBoost, using the common clinical data. By dividing the patients into an 8:2 training-to-testing ratio, the MLP model demonstrated the highest predictive performance with an AUC of 0.77 in the testing set, outperforming RF (0.72), SVM (0.63), and XGBoost (0.68). Subsequently, by intersecting the critical parameters selected by all four ML methods, we identified six common parameters (APTT, ALT/AST, ONT, MCHC, Weight, and NLR) that were then used to construct a logistic regression model. This refined model achieved an AUC of 0.74, indicating its robustness in predicting ENI. Notably, the nomogram based on these six parameters showed a markedly improved predictive performance compared to individual parameters, underscoring the value of this composite approach.

In the present study, we chose the intersection of top-ranked features across multiple ML algorithms as our primary feature selection strategy for several methodological and clinical reasons. This is because different ML algorithms have distinct biases in feature importance estimation. This consensus approach enhances reproducibility. In addition, methods like LASSO are sensitive to multicollinearity and may arbitrarily select one variable from a correlated group. SHAP values, while interpretable, can be computationally intensive and sensitive to model choice. Our intersection method provides a model-agnostic consensus, reducing dependency on any single algorithm’s output. Finally, we provide the Venn diagram to visually justify the selection, enhancing interpretability for clinicians.

The six predictors in our nomogram exhibit strong pathophysiological plausibility. APTT reflects intrinsic coagulation pathway activity; prolonged APTT may indicate impaired clot lysis or re-occlusion post-thrombolysis, increasing ENI risk. NLR is a well-established marker of systemic inflammation, which exacerbates blood–brain barrier disruption and cerebral edema after ischemic stroke. ONT is a critical determinant of tissue viability; delays beyond 4.5 h are associated with reduced reperfusion success and higher complication rates. ALT/AST ratio may reflect hepatic metabolic capacity and redox state, potentially influencing drug metabolism and oxidative stress. MCHC and weight may serve as proxies for nutritional status and comorbidity burden, which are known to affect stroke outcomes. Notably, we used the ALT/AST ratio rather than the more conventionally reported AST/ALT (De Ritis) ratio. While these ratios are mathematically reciprocal, their interpretability in predictive modeling differs. In our machine learning framework, the ALT/AST ratio demonstrated higher feature importance and better discrimination for ENI compared to the AST/ALT ratio. A lower ALT/AST ratio reflects relatively elevated AST levels, which may indicate subclinical hepatic dysfunction, increased oxidative stress, or systemic inflammation, the conditions known to impair neurovascular recovery after ischemic stroke (18, 19). Emerging evidence suggests that an elevated De Ritis ratio (low ALT/AST) is associated with increased infarct volume, hemorrhagic transformation, and poor functional outcomes in AIS (18, 19). This aligns with our finding that a lower ALT/AST ratio is negatively associated with ENI, reinforcing its role as a biomarker of metabolic vulnerability. Furthermore, ALT is predominantly expressed in hepatocytes, while AST is present in multiple tissues including brain, heart, and skeletal muscle; thus, a shift in this ratio may reflect multi-organ stress responses that modulate post-stroke recovery (20).

At present, the clinical application of this ML-based predictive model is substantial to the clinicians. It enables healthcare providers to identify AIS patients who are more likely to experience ENI after rt-PA treatment, thereby facilitating personalized care plans and timely interventions (21, 22). By leveraging the predictive power of the identified parameters, clinicians can optimize patient selection for thrombolysis, enhance monitoring strategies, and potentially improve outcomes. Moreover, the model’s ability to predict ENI may contribute to reducing the risk of adverse events and improving resource allocation in clinical settings (23, 24). Addition, the nomogram in our study could be used in a clinical setting to aid in decision-making and patient counseling. For example, clinicians can input this patient’s specific variables, such as APTT, ALT/AST, ONT, MCHC, Weight, and NLR, to generate a personalized probability of outcome. Suppose the nomogram-predicted risk is 75%. This high estimated risk may prompt earlier initiation of aggressive therapy or enrollment in a clinical trial, whereas a predicted risk of 20% might support a strategy of active surveillance. In patient counseling, this visual and quantitative tool can help clinicians clearly communicate individual risk, facilitating informed discussions about the potential benefits and harms of different management options. Future studies should focus on validating the model across diverse populations and integrating it into clinical decision support systems to maximize its utility in real-world practice.

Previously, there were studies using ML methods to construct predictive model of stroke outcomes (25, 26), but no study using ML methods to construct predictive model for ENI. In addition, although some previous studies have identified several clinical variables associated with ENI (27, 28), to knowledge, our study was firstly using multiple ML algorithms to construct predictive for ENI in AIS patients undergoing rt-PA treatment. In addition, our results revealed that this model showed a moderate predictive performance, which was more accuracy than previous studies that only present the variables associated with ENI. More importantly, unlike some expensive tests, the clinical indexes used to construct the model are common and cheap in clinical practice, thus it is easy for the clinical doctors to construct the predictive model, and it also did not add the addition burden on the patients. Finally, the nomogram enables the clinician to easy distinguish the patients with high risk of ENI. Therefore, our results hold promise for the precision medicine approaches in AIS patients undergoing rt-PA treatment.

To reduce the risk of overfitting of the model, our study used multiple, complementary strategies throughout the modeling pipeline to mitigate this risk. To reduce optimism in performance estimates, we implemented a strict train-test split (80%: 20%) and reported performance only on the held-out test set. The significant drop in AUC from training (XGBoost: 0.99) to testing (0.68) clearly indicates overfitting in some models, which is why we selected the MLP model (AUC 0.77), the most stable performer across training and test sets, as our primary ML model. In addition, rather than using all the variables in the final model, we drastically reduced dimensionality by selecting only 6 overlapping features from the top 20 of four diverse ML algorithms. This conservative selection was designed precisely to combat overfitting. Third, although we used a final logistic regression model, the intersection-based feature selection acts as a form of implicit regularization by selecting only features consistently ranked high across multiple algorithms, reducing the inclusion of spurious associations. Finally, the nomogram’s C-index (0.817) was bootstrap-corrected, meaning it was adjusted for overfitting using internal validation with 1,000 bootstrap resamples. This provides a more realistic estimate of model performance on new data.

In our study, the VIF value of the six parameters were all less than 2 (range: 1.28–1.45), indicating no significant multicollinearity. We note that although some original laboratory parameters (such as AST, ALT, NEU, LYM) are biologically related, our feature selection strategy prioritized composite indices (including ALT/AST ratio, NLR) over individual components, thereby reducing redundancy and enhancing model stability.

Nevertheless, our study also has many limitations. First, while our model demonstrates acceptable discrimination and calibration in internal validation, the retrospective design, single-center setting, and lack of external validation limit its generalizability. Second, the algorithm was built from the input features, and some hidden relationships may have been ignored because unknown or neglected features were not evaluated by physicians. Third, the patient’s long-term prognosis results were not collected. Fourth, the ML algorithms have its own limitation, which can suffer from overfitting, where models perform well on training data but fail to generalize to new, unseen data. Additionally, these methods often lack transparency, making it difficult to interpret the decision-making process, which can be a significant barrier in clinical applications where explainability is crucial (29, 30). Finally, despite our efforts to minimize overfitting, the relatively small sample size (n = 217) and high-dimensional feature space pose a risk of overfitting, a common challenge in clinical ML studies. Our findings require external validation in larger, multicenter cohorts to ensure generalizability. Therefore, future study is warranted to verify our results and address the above issues.

Conclusion

We developed and internally validated a machine learning-based nomogram that shows promising performance in predicting ENI after thrombolysis. The model, incorporating six clinically accessible variables, may serve as a potential tool to support clinical decision-making. However, Future research should focus on external validation and integration of this model into clinical practice to maximize its utility in clinical settings.

Data availability statement

The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding authors.

Ethics statement

The studies involving humans were approved by the Ethics Committee of the First Affiliated Hospital of Guangxi Medical University. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent from the patients/participants or patients/participants' legal guardian/next of kin was not required to participate in this study in accordance with the national legislation and the institutional requirements.

Author contributions

B-HL: Methodology, Writing – original draft, Validation, Software, Data curation, Formal analysis, Visualization. H-wD: Visualization, Validation, Formal analysis, Writing – original draft, Methodology, Software, Data curation. Z-yQ: Formal analysis, Data curation, Writing – original draft, Visualization. N-qM: Funding acquisition, Writing – original draft, Data curation, Methodology. G-mW: Data curation, Writing – original draft, Software. R-TH: Funding acquisition, Data curation, Conceptualization, Writing – review & editing, Formal analysis. CQ: Funding acquisition, Conceptualization, Writing – review & editing, Data curation.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. This study was partially supported by research funding from the National Natural Science Foundation of China (No. 81860222; 82060226) and Scientific Research Team Incubation Project of Guangxi Minzu Hospital (No. FY202107), Joint Project on Regional High-Incidence Diseases Research of Guangxi Natural Science Foundation (No. 2024JJD140151, 2024GXNSFBA010079), and the Self-Financed Scientific Research Projects of Guangxi Autonomous Region Health and Wellness Commission (Z-A20230506, Z-A20230498).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The authors declare that no Gen AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. G.B.D.N. Collaborators. Global, regional, and national burden of neurological disorders, 1990-2016: a systematic analysis for the global burden of disease study 2016. Lancet Neurol. (2019) 18:459–80. doi: 10.1016/S1474-4422(18)30499-X

PubMed Abstract | Crossref Full Text | Google Scholar

2. El-Hajj, VG, Daller, C, Fletcher-Sandersjoo, A, Gharios, M, Bydon, M, Soderman, M, et al. The negative impact of treatment delays on the long-term neurological outcomes of spinal dural arteriovenous fistulas: a longitudinal cohort study. Neurosurg Focus. (2024) 56:E14. doi: 10.3171/2023.12.FOCUS23703

PubMed Abstract | Crossref Full Text | Google Scholar

3. Wang, M, Yan, H, Zhang, Y, Zhou, Q, Meng, X, Lin, J, et al. Accelerated biological aging increases the risk of short- and long-term stroke prognosis in patients with ischemic stroke or TIA. EBioMedicine. (2024) 111:105494. doi: 10.1016/j.ebiom.2024.105494

Crossref Full Text | Google Scholar

4. Marko, M, Posekany, A, Szabo, S, Scharer, S, Kiechl, S, Knoflach, M, et al. Trends of r-tPA (recombinant tissue-type plasminogen activator) treatment and treatment-influencing factors in acute ischemic stroke. Stroke. (2020) 51:1240–7. doi: 10.1161/STROKEAHA.119.027921

Crossref Full Text | Google Scholar

5. Seners, P, Turc, G, Oppenheim, C, and Baron, JC. Incidence, causes and predictors of neurological deterioration occurring within 24 h following acute ischaemic stroke: a systematic review with pathophysiological implications. J Neurol Neurosurg Psychiatry. (2015) 86:87–94. doi: 10.1136/jnnp-2014-308327

PubMed Abstract | Crossref Full Text | Google Scholar

6. Jin, H, Bi, R, Hu, J, Xu, D, Su, Y, Huang, M, et al. Elevated serum lactate dehydrogenase predicts unfavorable outcomes after rt-PA thrombolysis in ischemic stroke patients. Front Neurol. (2022) 13:816216. doi: 10.3389/fneur.2022.816216

PubMed Abstract | Crossref Full Text | Google Scholar

7. Fu, M, Liu, Y, Hou, Z, and Wang, Z. Interpretable prediction of acute ischemic stroke after hip fracture in patients 65 years and older based on machine learning and SHAP. Arch Gerontol Geriatr. (2025) 129:105641. doi: 10.1016/j.archger.2024.105641

PubMed Abstract | Crossref Full Text | Google Scholar

8. Wen, J, Zhang, T, Ye, S, Li, C, Han, R, Huang, R, et al. Development of transient ischemic attack risk prediction model suitable for initializing a learning health system unit using electronic medical records. BMC Med Inform Decis Mak. (2024) 24:392. doi: 10.1186/s12911-024-02767-x

PubMed Abstract | Crossref Full Text | Google Scholar

9. Da Ros, V, Duggento, A, Cavallo, AU, Bellini, L, Pitocchi, F, Toschi, N, et al. Can machine learning of post-procedural cone-beam CT images in acute ischemic stroke improve the detection of 24-h hemorrhagic transformation? A preliminary study. Neuroradiology. (2023) 65:599–608. doi: 10.1007/s00234-022-03070-0

Crossref Full Text | Google Scholar

10. Jiang, Y, Zhao, Q, Guan, J, Wang, Y, Chen, J, and Li, Y. Analyzing prehospital delays in recurrent acute ischemic stroke: insights from interpretable machine learning. Patient Educ Couns. (2024) 123:108228. doi: 10.1016/j.pec.2024.108228

PubMed Abstract | Crossref Full Text | Google Scholar

11. Petrovic, I, Broggi, S, Killer-Oberpfalzer, M, Pfaff, JAR, Griessenauer, CJ, Milosavljevic, I, et al. Predictors of in-hospital mortality after thrombectomy in anterior circulation large vessel occlusion: a retrospective, machine learning study. Diagnostics. (2024) 14:1531. doi: 10.3390/diagnostics14141531

Crossref Full Text | Google Scholar

12. Wen, R, Wang, M, Bian, W, Zhu, H, Xiao, Y, Zeng, J, et al. Machine learning-based prediction of early neurological deterioration after intravenous thrombolysis for stroke: insights from a large multicenter study. Front Neurol. (2024) 15:1408457. doi: 10.3389/fneur.2024.1408457

PubMed Abstract | Crossref Full Text | Google Scholar

13. Fan, K, Cao, W, Chang, H, and Tian, F. Predicting prognosis in patients with stroke treated with intravenous alteplase through blood pressure changes: a machine learning-based approach. J Clin Hypertens (Greenwich). (2023) 25:1009–18. doi: 10.1111/jch.14732

PubMed Abstract | Crossref Full Text | Google Scholar

14. Annus, A, Tomosi, F, Rarosi, F, Feher, E, Janaky, T, Kecskemeti, G, et al. Kynurenic acid and kynurenine aminotransferase are potential biomarkers of early neurological improvement after thrombolytic therapy: a pilot study. Adv Clin Exp Med. (2021) 30:1225–32. doi: 10.17219/acem/141646

PubMed Abstract | Crossref Full Text | Google Scholar

15. Lai, Y, Diana, F, Mofatteh, M, Nguyen, TN, Jou, E, Zhou, S, et al. Predictors of failure of early neurological improvement in early time window following endovascular thrombectomy: a multi-center study. Front Neurol. (2023) 14:1227825. doi: 10.3389/fneur.2023.1227825

PubMed Abstract | Crossref Full Text | Google Scholar

16. Xiufu, Z, Ruipeng, L, Jun, Z, Yonglong, L, Yulin, W, Jian, Z, et al. Analysis of influencing factors of early neurological improvement after intravenous rt-PA thrombolysis in acute anterior circulation ischemic stroke. Front Neurol. (2022) 13:1037663. doi: 10.3389/fneur.2022.1037663

PubMed Abstract | Crossref Full Text | Google Scholar

17. Gong, PY, Liu, YK, Gong, YC, Chen, G, Zhang, XH, Wang, SY, et al. The association of neutrophil to lymphocyte ratio, platelet to lymphocyte ratio, and lymphocyte to monocyte ratio with post-thrombolysis early neurological outcomes in patients with acute ischemic stroke. J Neuroinflammation. (2021) 18:51–62. doi: 10.1186/s12974-021-02090-6

Crossref Full Text | Google Scholar

18. Ahmadabad, MA, Naeimi, A, Keymoradzadeh, A, Faghani, S, Ahmadabad, MA, Boroujeni, NA, et al. Evaluation of De Ritis (AST/ALT), ALP/ALT, and AST/ALP ratios as prognostic factors in patients with acute ischemic stroke. BMC Neurol. (2022) 22:450. doi: 10.1186/s12883-022-02989-4

PubMed Abstract | Crossref Full Text | Google Scholar

19. Gao, F, Chen, C, Lu, J, Zheng, J, Ma, XC, Yuan, XY, et al. De Ritis ratio (AST/ALT) as an independent predictor of poor outcome in patients with acute ischemic stroke. Neuropsychiatr Dis Treat. (2017) 13:1551–7. doi: 10.2147/NDT.S139316

PubMed Abstract | Crossref Full Text | Google Scholar

20. Lai, X, Chen, H, Dong, X, Zhou, G, Liang, D, Xu, F, et al. AST to ALT ratio as a prospective risk predictor for liver cirrhosis in patients with chronic HBV infection. Eur J Gastroenterol Hepatol. (2024) 36:338–44. doi: 10.1097/MEG.0000000000002708

PubMed Abstract | Crossref Full Text | Google Scholar

21. Venkataram, T, Kashyap, S, Harikar, MM, Inserra, F, Barone, F, Travali, M, et al. The application of machine learning for treatment selection of unruptured brain arteriovenous malformations: a secondary analysis of the ARUBA trial data. Clin Neurol Neurosurg. (2024) 249:108681. doi: 10.1016/j.clineuro.2024.108681

Crossref Full Text | Google Scholar

22. Qiao, X, Lu, C, Xu, M, Yang, G, Chen, W, and Liu, Z. DeepSAP: a novel brain image-based deep learning model for predicting stroke-associated pneumonia from spontaneous intracerebral hemorrhage. Acad Radiol. (2024) 31:5193–203. doi: 10.1016/j.acra.2024.06.025

PubMed Abstract | Crossref Full Text | Google Scholar

23. Tian, C, Ji, Z, Xiang, W, Huang, X, Wang, S, Wu, Y, et al. Association of lower leukocyte count before thrombolysis with early neurological improvement in acute ischemic stroke patients. J Clin Neurosci. (2018) 56:44–9. doi: 10.1016/j.jocn.2018.08.004

PubMed Abstract | Crossref Full Text | Google Scholar

24. Agarwal, S, Cutting, S, Grory, BM, Burton, T, Jayaraman, M, McTaggart, R, et al. Redefining early neurological improvement after reperfusion therapy in stroke. J Stroke Cerebrovasc Dis. (2020) 29:104526. doi: 10.1016/j.jstrokecerebrovasdis.2019.104526

PubMed Abstract | Crossref Full Text | Google Scholar

25. Yao, Z, Ji, Q, Zang, X, Yun, W, Luo, Y, Cao, J, et al. Explainable machine learning model for predicting functional outcomes in posterior circulation stroke after thrombectomy. J Neurointerv Surg. (2025):jnis-2025-023624. doi: 10.1136/jnis-2025-023624

PubMed Abstract | Crossref Full Text | Google Scholar

26. Ma, L, Ji, L, Cheng, Z, Geng, X, and Ding, Y. Developing an explainable prognostic model for acute ischemic stroke: combining clinical and inflammatory biomarkers with machine learning. Brain Behav. (2025) 15:e70673. doi: 10.1002/brb3.70673

PubMed Abstract | Crossref Full Text | Google Scholar

27. Li, L, Han, Z, Wang, R, Fan, J, Zheng, Y, Huang, Y, et al. Association of admission neutrophil serine proteinases levels with the outcomes of acute ischemic stroke: a prospective cohort study. J Neuroinflammation. (2023) 20:70. doi: 10.1186/s12974-023-02758-1

PubMed Abstract | Crossref Full Text | Google Scholar

28. Heinze, M, Cheng, B, Cho, TH, Ebinger, M, Endres, M, Fiebach, JB, et al. Predictors of early neurological improvement and its relationship to thrombolysis treatment and long-term outcome in the WAKE-UP study. Cerebrovasc Dis. (2023) 52:560–6. doi: 10.1159/000528805

PubMed Abstract | Crossref Full Text | Google Scholar

29. Panch, T, Mattie, H, and Atun, R. Artificial intelligence and algorithmic bias: implications for health systems. J Glob Health. (2019) 9:010318. doi: 10.7189/jogh.09.020318

PubMed Abstract | Crossref Full Text | Google Scholar

30. Ed-Driouch, C, Mars, F, Gourraud, PA, and Dumas, C. Addressing the challenges and barriers to the integration of machine learning into clinical practice: an innovative method to hybrid human-machine intelligence. Sensors. (2022) 22:8313. doi: 10.3390/s22218313

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: early neurological improvement, predictive model, machine learning algorithms, acute ischemic stroke, intravenous thrombolysis

Citation: Lv B-H, Deng H-w, Qin Z-y, Meng N-q,Weng G-m, Hu R-T and Qin C (2025) A machine learning-based predictive nomogram for early neurological improvement after thrombolysis in acute ischemic stroke. Front. Neurol. 16:1662498. doi: 10.3389/fneur.2025.1662498

Received: 09 July 2025; Accepted: 27 October 2025;
Published: 12 November 2025.

Edited by:

Ming Wei, Tianjin Huanhu Hospital, China

Reviewed by:

Wenbo Zhao, Capital Medical University, China
Shan Lv, Capital Medical University, China

Copyright © 2025 Lv, Deng, Qin, Meng, Weng, Hu and Qin. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Rui-Ting Hu, MjE0ODQ2NjQyQHFxLmNvbQ==; Chao Qin, bWRxYzIwMTlAMTI2LmNvbQ==

^†These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.