- 1Department of Neurology, The First Clinical College, Xuzhou Medical University, Xuzhou, China
- 2Department of Neurology, Affiliated Hospital of Xuzhou Medical University, Xuzhou, China
- 3Department of Anesthesiology, The First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, China
- 4Department of Neurology, Hongze District People's Hospital, Huaian, China
Introduction: Acute ischemic stroke (AIS) patients often experience poor functional outcomes post-intravenous thrombolysis (IVT). Novel computational methods leveraging machine learning (ML) architectures increasingly support medical decision-making. We aimed to develop and validate a machine learning model to predict 3-month unfavorable functional outcome after IVT in AIS patients.
Methods: This retrospective study developed ML prognostic models for 3-month functional outcome (modified Rankin scale scores of 3–6) in IVT-treated AIS patients. A derivation cohort (n = 938) was split 7:3 for training/testing, with an independent external validation cohort (n = 324). The least absolute shrinkage and selection operator (LASSO) regression selected predictors from clinical/neuroimaging/laboratory variables. Eight ML algorithms (including Logistic Regression, Random Forest, Extreme Gradient Boosting, Multilayer Perceptron, Support Vector Machine, Light Gradient Boosting Machine, Decision Tree, and K-Nearest Neighbors) were trained using 10-fold cross-validation and evaluated on test/external sets via the area under the curve (AUC), accuracy, precision, recall and F1-score. Additionally, the SHapley Additive exPlanations (SHAP) interpreted the optimal model.
Results: 938 patients constituted the derivation cohort (training: n = 656, test: n = 282) and 324 patients the external validation cohort. Unfavorable 3-month outcomes (mRS 3–6) occurred in 25.7% and 22.8%, respectively. LASSO regression selected five predictors: the neutrophil-to-lymphocyte ratio (NLR), admission National Institutes of Health Stroke Scale (NIHSS) score, the Alberta Stroke Program Early CT Score (ASPECTS), atrial fibrillation, and blood glucose. While tree-based methods like XGBoost and LightGBM showed elevated training performance (e.g., XGBoost training AUC = 0.878) but significant drops in validation (AUC = 0.791), LR demonstrated optimal performance: robust training AUC (0.792), minimal validation degradation (AUC = 0.787). LR model was subsequently employed as classification method demonstrating optimal performance with (AUC = 0.777) in the test dataset. External validation confirmed LR’s stability (AUC = 0.797). SHAP analysis ranked NLR as the strongest predictor (followed by NIHSS/ASPECTS), with higher values increasing risk. Learning curves indicated no overfitting. A nomogram enabled individualized risk quantification.
Conclusion: A parsimonious 5-variable LR model robustly predicts 3-month post-IVT outcomes, combining clinical utility, interpretability, and generalizability. NLR-driven inflammation is critical to prognosis. This tool facilitates early high-risk patient identification for personalized intervention.
1 Introduction
Stroke remains as a global health crisis, ranking as the second leading contributor to mortality worldwide and the third leading cause of long-term disability (1). It imposes a substantial global health burden at both individual and societal levels, with the rate of disability burden increasing more rapidly in low-income and middle-income countries than in high-income countries (2–4). Acute ischemic stroke (AIS) is defined as sudden neurological dysfunction caused by focal brain ischemia lasting more than 24 h or accompanied by evidence of acute infarction on brain imaging, regardless of symptom duration, accounts for approximately 70% of incident stroke events (5, 6). Intravenous thrombolysis (IVT), administered within the 4.5-h time window, constitutes the gold-standard therapy for AIS, as universally endorsed by international guidelines (7). Despite advancements in endovascular thrombectomy, IVT remains the most accessible and efficacious reperfusion treatment for patients with AIS in clinical practice, owing to its widespread availability and relative simplicity of administration (8, 9). Despite its established efficacy in enhancing functional recovery, nearly half of IVT-treated patients experience unfavorable functional outcomes at 3 months. The modified Rankin Scale (mRS; range 0–6, where 6 indicates death), which integrates both motor and cognitive components and encompasses the constructs of impairment, disability, and handicap, is considered to be the most accepted outcome for assessing the efficacy of interventions of AIS (10, 11). Given the substantial neurological disability burden associated with AIS (12), developing validated predictive tools remains imperative for the early identification of patients susceptible to adverse functional outcomes. Such prognostic stratification would facilitate targeted interventions and optimized resource allocation, ultimately improving long-term neurological prognosis. However, many existing prediction models are limited by their suboptimal predictive accuracy and the lack of robust external validation, resulting in uncertain generalizability to broader, more diverse populations (13, 14). Furthermore, numerous tools rely on high-dimensional data—incorporating extensive imaging, genomic, or biomarker variables—which complicates clinical interpretation and practical implementation, thereby hindering widespread adoption (15, 16). The development of novel, concise, yet robust prediction tools is therefore essential to enhance clinical relevance and facilitate translation into routine care.
Inflammation and immune responses critically mediate all phases of cerebral ischemia pathogenesis. Following ischemic insult, the inflammatory response initiated promptly. Focal brain ischemia stimulates what is called sterile inflammation (17), trigger inflammatory signaling through the activation of microglia, which subsequently release pro-inflammatory cytokines and chemokines, thereby promoting robust pro-inflammatory cascades, propelling the pathophysiological progression (18, 19). Critically, ischemic microenvironments trigger local immune responses, characterized by inflammatory cytokine production, which exacerbate blood–brain barrier (BBB) permeability (20, 21). Notably, neutrophils are the earliest leukocytes recruited from peripheral blood into the brain (22, 23). Neutrophils induce neurotoxicity through multiple mechanisms such as the participation in thrombus formation and expansion, upregulation of matrix metalloproteinases, excessive generation of reactive oxygen species, and the release of neutrophil extracellular traps (NETs) (24–26). The subsequent increase in capillary permeability, disruption of the BBB, and cellular edema can collectively impair post-stroke revascularization and vascular remodeling, thereby adversely affecting stroke outcomes (27). Clinical studies demonstrated the early increase of peripheral neutrophils as an independent predictor of neurological deterioration and poor outcome (28, 29). In addition, acute central nervous system injury can induce a state of immunodepression by activating the sympathetic nervous system and hypothalamic–pituitary–adrenal axis, leading to elevated catecholamines and steroids that cause apoptosis and functional deactivation of peripheral lymphocytes (30). Lymphocytes serve as pivotal regulators of host defense, and their depletion markedly elevates susceptibility to infections. Clinical research data indicates that low lymphocyte counts constitute an independent predictor of infection risk in stroke patients (31, 32). Emerging evidence underscores the prognostic significance of these mechanisms of leukocyte-derived inflammation in post-stroke outcomes (27), with the neutrophil-to-lymphocyte ratio (NLR) validated as a predictive biomarker for clinical outcome in AIS patients receiving IVT (33). While baseline NLR has been established as an independent risk factor for outcomes including early neurological improvement (ENI), hemorrhagic transformation (HT), and mortality in AIS patients (34), the predominant focus of current NLR research on univariate assessments fails to capture synergistic interactions with clinical covariates (35). This methodological constraint impedes clinical translation, given that isolated biomarkers inherently lack the discriminative power for complex multifactorial outcomes.
Machine learning (ML), a rapidly advancing branch of artificial intelligence (AI), leveraging computational advances to uncover predictive insights from high-dimensional data, demonstrates growing utility in clinical stroke research (36, 37). ML offers substantial advantages in predictive accuracy and in identifying previously overlooked patient subgroups defined by unique physiological characteristics and prognostic trajectories. Various methodologies exist for feature selection within the domain of ML. Notably, the least absolute shrinkage and selection operator (LASSO) regression distinguishes itself from conventional stepwise regression techniques, which utilize forward or backward variable selection, by facilitating the effective screening of a greater number of variables even when the sample size is limited (38). Moreover, LASSO regression provides superior feature selection from high-dimensional biomedical datasets while addressing multicollinearity limitations inherent in conventional methods (39). As a result, LASSO-based ML methods demonstrate enhanced prognostic discrimination across diverse medical applications (40–42). Furthermore, to compensate for the scarcity of interpretable evidence supporting predictive models, we deployed the SHapley Additive exPlanations (SHAP) analysis. This technique offers intuitive, feature-level explanations, which are critical for validating model efficacy and building trust (43). Consequently, integrating complementary clinical variables using ML models and SHAP interpretation may optimize the prediction of unfavorable outcomes for post-IVT AIS patients.
Therefore, we aimed to develop and validate a machine learning model for predicting 3-month functional outcomes in IVT-treated AIS patients, incorporating interpretability analysis to elucidate predictor contributions to the model predictions.
2 Materials and methods
2.1 Study population
This retrospective study enrolled patients diagnosed with AIS who received IVT within the 4.5-h treatment window. The derivation cohort consisted of 938 patients treated at The Affiliated Hospital of Xuzhou Medical University between September 2020 and October 2024. Admission non-contrast head computed tomography (CT) confirmed the absence of acute hemorrhage. An independent external validation cohort comprised 324 consecutive patients treated with IVT for AIS at Hongze District People’s Hospital between January 2019 and December 2022. Identical inclusion and exclusion criteria were applied to both cohorts. Inclusion criteria were: (1) over 18 years of age; (2) clinically and neuroimaging-confirmed diagnosis of AIS; (3) within 4.5 h of symptom onset, followed by recombinant tissue plasminogen activator (rt-PA) treatment (0.9 mg/kg up to a maximum of 90 mg, 10% of the dose as a bolus followed by a 60-min infusion of the remaining dose). Exclusion criteria were: (1) pre-stroke modified Rankin Scale (mRS) scores > 2, indicating significant pre-existing disability; (2) unavailable 3-month post-stroke mRS assessment; (3) receipt of subsequent endovascular thrombectomy; (4) active malignancy or major trauma at admission; (5) incomplete clinical data. To assess potential selection bias, we compared patients excluded due to missing data with the final derivation cohort across baseline characteristics. The study protocol received approval from the Ethics Committee of The Affiliated Hospital of Xuzhou Medical University (Approval number: XYFY2025-KL044-01). Given its retrospective design using anonymized data, the requirement for written informed consent was waived.
2.2 Data collection
The analysis incorporated these clinical variables: (1) Demographics: age, sex and body mass index (BMI). (2) Medical history: hypertension, diabetes mellitus (DM), coronary heart disease (CHD), atrial fibrillation (AF), anticoagulant/antiplatelet medications, smoking status, and alcohol consumption; (3) Clinical features: admission systolic/diastolic blood pressure (SBP/DBP), onset-to-treatment time (OTT), National Institutes of Health Stroke Scale (NIHSS) score at admission and discharge, Trial of ORG 10172 in Acute Stroke Treatment (TOAST) classification, Alberta Stroke Program Early CT Score (ASPECTS) (44), mRS score at 3 months; (4) Laboratory indices: admission levels of neutrophil, lymphocyte, platelet, eosinophil counts, blood glucose, albumin, and glycated hemoglobin (HbA1c). The neutrophil-to-lymphocyte ratio (NLR) was calculated at admission by dividing absolute neutrophil count by absolute lymphocyte count.
2.3 Outcome definition
Functional outcome was assessed using the mRS 3 months after IVT. Evaluations were performed during scheduled clinic visits by board-certified neurologists blinded to the predictive model development. For patients unable to attend clinic, structured telephone interviews were conducted by trained research nurses using a validated protocol to ensure reliable mRS scoring. The primary outcome was unfavorable functional outcome, defined as mRS score 3–6. A favorable outcome was defined as mRS score 0–2 (45).
2.4 Feature selection
Feature selection was performed using the Least Absolute Shrinkage and Selection Operator (LASSO) regression (46). This regularization technique minimizes overfitting by applying an L1 penalty term that shrinks coefficients and drives some coefficients of non-informative features to zero. Continuous variables were standardized (mean = 0, standard deviation = 1) before model fitting to ensure equivalent scaling of the penalty term. LASSO regression was performed on the derivation cohort training set (70% of derivation cohort). Feature subset optimization against overfitting was achieved by determining the optimal regularization parameter (λ) value through the standard error of the minimum distance based 10-fold cross-validation (47). Features with non-zero coefficients after LASSO regularization were retained for subsequent modeling.
2.5 Machine learning model development and evaluation
2.5.1 Model development
Eight supervised machine learning algorithms were trained to predict the 3-month unfavorable functional outcome using the features selected by LASSO: Logistic Regression (LR), Random Forest (RF), Extreme Gradient Boosting (XGBoost), Multilayer Perceptron (MLP), Support Vector Machine (SVM), Light Gradient Boosting Machine (LightGBM), Decision Tree (DT), and K-Nearest Neighbors (KNN). Models were implemented using Python libraries (scikit-learn 0.22.1, XGBoost 1.2.1, LightGBM 3.2.1). The derivation cohort was randomly stratified by outcome and split into a training set (70%) and a held-out internal test set (30%). Hyperparameter tuning for each algorithm was performed exclusively on the training set using a nested 10-fold cross-validation strategy. The inner loop of the cross-validation was optimized by maximizing the Area Under the Receiver Operating Characteristic Curve (ROC-AUC). The internal test set was used only once for the final comparative evaluation of all tuned models.
We implemented a comprehensive tuning strategy using grid search with cross-validation. For tree-based models (XGBoost, LightGBM, Random Forest, Decision Tree), we focused on regularization parameters including max_depth, min_samples_split, and reg_lambda to control model complexity and prevent overfitting. For linear models (Logistic Regression, SVM), we optimized regularization strength through the C parameter. All preprocessing steps were fitted solely on the training folds of the inner loop to prevent any data leakage. The optimized hyperparameters from the inner loop were then used to train a final model on the entire training set for evaluation on the held-out internal test set.
2.5.2 Model evaluation and comparison
Model performance was assessed using: (1) Discrimination: Primary metric: Area Under the Receiver Operating Characteristic Curve (ROC-AUC). Secondary metrics: Accuracy, Precision, Recall, F1-Score. Optimal classification thresholds were determined by maximizing the Youden Index on the validation folds. ROC curves and AUC values were generated for all datasets: internal training (using cross-validation predictions), internal test set, and external validation set. (2) Calibration: Calibration curves plotted predicted probabilities against observed event frequencies (Python, sklearn 0.22.1). Perfect calibration demonstrates along the 45° line. The Brier score was also reported (lower score indicates better calibration, range 0–1). (3) Clinical Utility: Decision Curve Analysis (DCA) implemented in R software (rmda 1.6) assessed the net benefit across a range of probability thresholds (15–35%) relevant for clinical decision-making. The performance metrics on the internal test set were compared across all eight algorithms to identify the optimal predictive model.
2.5.3 Model interpretation
The SHapley Additive exPlanations (SHAP) method (Python SHAP v0.39.0) was applied to the selected optimal model for interpretability (48). SHAP values attribute a contribution value to each feature for each individual prediction, enabling local and global interpretability. Graphical depiction techniques included: (1) Summary plots identifying the five most influential covariates through value magnitude visualization; (2) Dependency plots elucidating marginal effect relationships between feature variations and Shapley value fluctuations; (3) Global feature importance analysis combined with partial dependence evaluations. This integrated approach delineates directional associations between explanatory variables and adverse outcome predictions.
2.5.4 External validation
The generalizability of the final optimized model was evaluated by applying the parameters trained on the full derivation cohort training set to the independent, prospectively collected external validation cohort from Hongze District People’s Hospital. AUC, sensitivity and specificity were computed.
2.6 Statistical analysis
Statistical analysis was conducted using SPSS (Statistical Package for the Social Sciences, v26.0), R (v4.2.3), and Python (v3.11.4). Continuous variables were summarized as mean ± standard deviation (SD) or medians (IQRs), with group comparisons conducted using the Mann–Whitney U test. Categorical variables were reported as frequency percentages (%), analyzed through Pearson’s χ2 or Fisher’s exact tests. Independent predictor capacity was expressed through odds ratios (95% confidence intervals). All statistical tests were two-tailed, adopting p-value < 0.05 as the significance statistically.
3 Results
3.1 Baseline characteristics
This research initially enrolled 1,529 patients diagnosed with AIS and received IVT within 4.5 h of symptom onset. After applying the exclusion criteria, the final derivation cohort consisted of 938 patients (Figure 1). Exclusions included: endovascular therapy (n = 173), pre-stroke mRS > 2 (n = 55), missing data (n = 162), concurrent malignancy or major trauma (n = 35), and loss to 3-month follow-up (n = 166). Patients missing essential record required for model development (n = 162) were excluded. To assess potential selection bias, we compared these excluded patients with the included derivation cohort (n = 938) across baseline characteristics including demographics, clinical features, and laboratory indices. No significant differences were observed in any variable (all p-values > 0.05; Supplementary Table 1), indicating comparable profiles between groups. This supports the representativeness of the analyzed cohort despite missing data handling via complete-case analysis. The derivation cohort was randomly split into a training set (70%, n = 656) and an internal test set (30%, n = 282). Baseline characteristics did not differ significantly (p > 0.05) between the training and internal test sets (Supplementary Table 2), confirming successful randomization and mitigating selection bias.

Figure 1. Flowchart of patient selection. AIS, acute ischemic stroke; IVT, intravenous thrombolysis; mRS score, modified Rankin Scale.
The overall derivation cohort (n = 938) had a mean age of 68 years (range 59–77), with males comprising 65.9%. Table 1 details clinical profiles stratified by 3-month functional outcome (favorable with mRS 0–2 vs. unfavorable with mRS 3–6). Unfavorable outcomes occurred in 241 patients (25.7%). The external validation set (n = 324) showed a comparable unfavorable outcome rate of 22.8% (n = 74). Reduced functional recovery significantly correlated with multiple clinical indicators including: advanced age, atrial fibrillation, smoking/drinking history, anticoagulant therapy, admission SBP, onset-to-treatment time, ASPECTS, baseline NIHSS, TOAST classification, NLR, platelet count, eosinophil level, albumin, RDW, HDL, and glucose levels (p < 0.05; Table 1).
3.2 Feature selection for outcome prediction
The selection of predictive features was performed using Least Absolute Shrinkage and Selection Operator (LASSO) regression, a penalized regression technique designed to handle multicollinearity and prevent overfitting by shrinking the coefficients of non-informative variables to zero. An initial pool of 17 clinically accessible variables, encompassing demographics, medical history, clinical presentation, imaging features, and laboratory indices, was standardized and entered into the model. The optimal regularization parameter (λ) was determined via 10-fold cross-validation on the derivation training set (n = 656), minimizing the binomial deviance. This process identified the optimal λ parameter (lambda with minimum distance = 0.039; Figure 2), which addresses multicollinearity and overfitting through coefficient shrinkage (49). At this optimal λ, the model retained five variables with non-zero coefficients: NLR, baseline NIHSS, ASPECTS, atrial fibrillation, and blood glucose. Subsequently, multivariate logistic regression analysis confirmed that each of these five variables was independently associated with an increased risk of unfavorable outcome (p < 0.05; Table 2) (50). Odds ratios (OR) with 95% confidence intervals (CI) are reported in Table 2, and the LASSO coefficients are provided in Supplementary Table 3.

Figure 2. Feature selection based on LASSO model. (A) Coefficient profiles during LASSO regularization. (B) Ten-fold cross-validation for identifying the optimal λ value.
3.3 Models performance comparisons
Eight machine learning models (XGBoost, LR, LightGBM, RF, DT, MLP, SVM, KNN) were trained and tuned using the five selected predictors on the derivation training set via 10-fold cross-validation. Performance was independently evaluated on the held-out internal test set. Our optimization encompassed tuning of core hyperparameters across all model types. For tree-based models (e.g., XGBoost, LightGBM), we focused on parameters controlling tree complexity (e.g., depth constraints, leaf node quantities), regularization intensity (e.g., L1/L2 coefficients), and key overfitting prevention mechanisms. For linear models (e.g., Logistic Regression, SVM), we systematically optimized regularization types and strengths. All parameters were determined through comprehensive grid search with cross-validation.
Comprehensive performance evaluation revealed distinct characteristics among the models. As shown in Figure 3A, all models demonstrated reasonable training performance without perfect discrimination (AUC range: 0.708–0.878), indicating successful mitigation of overfitting through our optimized regularization approach. While tree-based methods like XGBoost and LightGBM showed elevated training performance (e.g., XGBoost training AUC = 0.878; Figure 3A) but significant drops in validation (AUC = 0.791; Figure 3B), Logistic Regression achieved stable cross-validated performance (AUC = 0.792, 95% CI: 0.754–0.829) that was maintained on the validation set (AUC = 0.787, 95% CI: 0.673–0.900) with minimal performance degradation (ΔAUC = 0.005) (Figure 3B). Critical assessment of model calibration demonstrated LR’s superior reliability with optimal Brier score (0.140, 95% CI: 0.131–0.149), significantly outperforming other models in calibration accuracy (Figure 3C). Decision Curve Analysis confirmed LR’s clinical utility, providing the greatest net benefit across clinically relevant threshold probabilities (15–35%) (Figure 3D). Precision-Recall analysis further supported LR’s robust performance (AP = 0.656) in handling class imbalance (Figures 3E,F). Accuracy, Precision, Recall, F1-Score and Cutoff value for all models are detailed in Table 3. Collectively, based on its optimal balance of discriminatory ability, calibration reliability, and clinical utility across multiple metrics, Logistic Regression was selected as the preferred model for deployment.

Figure 3. Comprehensive analysis of ML algorithms. (A) ROC curves comparing discrimination performance of eight ML models for predicting post-thrombolysis outcomes in the training cohort. (B) ROC curves demonstrating generalization performance in validation cohort. (C) Calibration curves depicting agreement between predicted probabilities (x-axis) and observed event frequencies (y-axis). Logistic Regression showed optimal calibration. (D) Decision Curve Analysis (DCA) assessing net benefit across probability thresholds. (E) Training set PR curve. (F) Test set PR curve. Precision-recall relationships at varying probability thresholds. Horizontal dashed line indicates positive event rate.
3.4 Development and validation of the optimal model
The optimal Logistic Regression model, using the five selected predictors (AF, ASPECTS, NIHSS, Blood Glucose, NLR), was refined on the entire derivation training set (n = 656) using 10-fold cross-validation. The mean cross-validated AUC was 0.794 (95% CI: 0.749–0.838; range across folds: 0.728–0.851; Figure 4A). Validation set performance maintained stability (mean AUC 0.788, 95% CI: 0.655–0.920) (Figure 4B). Evaluation on the internal test set (n = 282) demonstrated sustained performance (AUC = 0.777, 95% CI: 0.710–0.844; Figure 4C). Accuracy, specificity, and sensitivity on the test set were 0.791, 0.860, and 0.567, respectively. The learning curve analysis indicated stable model performance, with converging training and validation accuracy estimates remaining above 80% without significant divergence as the sample size increased, suggesting the model was adequately fitted without overfitting (Figure 4D). Calibration on the test set was moderate (Figure 4E; Brier Score = 0.140). Decision Curve Analysis confirmed positive net benefit across the same relevant probability thresholds (Figure 4F).

Figure 4. Logistic regression model development encompassed training cohort modeling, cross-validation refinement, and independent testing cohort verification. (A) Training set ROC analysis with 10-fold cross-validation. (B) Validation set ROC performance across 10 folds. (C) Test set discrimination performance (AUC 0.777, 95% CI: 0.710–0.844). Blue: model performance; red dashed: random classifier. (D) Learning curve: AUC by training sample size. Validation (blue dashed) converges with training (red) beyond 300 samples, indicating minimal overfitting. (E) Calibration curve showing agreement between predicted probabilities and observed outcomes. (F) Decision curve analysis: Logistic model (red solid) shows superior net benefit versus treat-all (black dashed) and treat-none (red dotted) strategies, particularly at 20–50% thresholds. Peak net benefit (0.17) occurs at 30% risk threshold.
3.5 Optimal model interpretation
SHapley Additive exPlanations (SHAP) analysis was employed to interpret the final Logistic Regression model globally and locally. The SHAP summary plot (Figure 5A) illustrates the impact and directionality of each predictor. Higher NLR values and NIHSS scores consistently increased the risk of an unfavorable outcome, while higher ASPECTS scores decreased the risk. Presence of AF and higher blood glucose levels also generally increased the risk. NLR exhibited predominant risk-enhancing properties, where elevated values substantially increasing adverse outcome risk. Mean absolute SHAP value analysis ranked the features by their overall contribution to the model’s predictions: NLR was the most influential predictor, followed by NIHSS, ASPECTS, Atrial Fibrillation and Blood Glucose (Figure 5B). Examples of local interpretability are shown in Figure 5C (patient predicted low risk, actual favorable outcome) and Figure 5D (patient predicted high risk, actual unfavorable outcome). Each feature’s specific value and its SHAP contribution (increasing or decreasing the predicted probability away from the base value) combine to generate the individual prediction.

Figure 5. Feature importance and interpretation. (A) SHAP summary plot: Directional impact of predictors on outcome risk. Point position indicates effect direction—positive values denote enhanced risk probability, while negative counterparts indicate protective functions. (B) Quantitative comparison of predictive influence magnitude. Error bars represent standard error of mean absolute SHAP values. (C) SHAP explanation for a patient with actual favorable outcome. (D) SHAP explanation for a patient with actual unfavorable outcome. Arrow length encodes predictor influence magnitude on prediction.
3.6 External validation and clinical application
External validation was conducted using an independent external Hongze District People’s Hospital validation cohort comprising 324 stroke patients receiving IVT, with unfavorable outcomes documented in 74 cases (22.8%). It demonstrated robust generalizability, achieving an AUC of 0.797 (95% CI: 0.737–0.858; Figure 6A), sensitivity of 0.730, and specificity of 0.752. To facilitate clinical implementation, a nomogram was constructed based on the final logistic regression coefficients (Figure 6B), enabling clinicians to estimate individualized probabilities of an unfavorable 3-month outcome for patients receiving IVT for AIS.

Figure 6. (A) ROC curve for predicting unfavorable outcome in external validation cohort. (B) Nomogram for clinical risk prediction: Point-based scoring system. To estimate probability: (1) Locate predictor value on corresponding axis; (2) Draw vertical line to Points axis; (3) Sum points; (4) Project total to probability axis.
4 Discussion
This study has developed and validated a machine learning-based predictive model for 3-month functional outcomes following IVT in AIS patients. Using LASSO regression for feature selection followed by multivariate logistic regression, we identified five key predictors from 17 candidate variables (51): NLR, baseline NIHSS, ASPECTS, atrial fibrillation, and blood glucose. This parsimonious set of readily available clinical, radiological, and inflammatory biomarkers provides a practical foundation for prognostic assessment. We comprehensively evaluated eight machine learning models: Logistic Regression (LR), Random Forest (RF), Extreme Gradient Boosting (XGBoost), Multilayer Perceptron (MLP), Support Vector Machine (SVM), Light Gradient Boosting Machine (LightGBM), Decision Tree (DT), and K-Nearest Neighbors (KNN) (52). Based on the comprehensive performance evaluation across all validation metrics, logistic regression emerged as the most stable and reliable model, consistently demonstrating balanced performance without signs of overfitting. In contrast to complex tree-based ensemble methods (e.g., XGBoost, LightGBM) and other non-linear algorithms, which exhibited enhanced in-sample metrics (e.g., XGBoost training AUC = 0.878) yet notable declines in external validation (AUC = 0.791), logistic regression preserved strong discriminative capability on the training set (AUC = 0.792) and validation set (AUC = 0.787) with only marginal deterioration. A critical advantage of LR was its exceptional calibration precision, reflected by a Brier score of 0.140, indicating close agreement between forecast probabilities and actual event rates. Moreover, it yielded the greatest and most consistent net clinical benefit throughout the spectrum of relevant probability thresholds as evidenced by decision curve analysis. This stability likely reflects the intrinsic alignment between LR’s linear decision boundary and the predominantly additive relationships among our predictors, whereas unnecessary nonlinear complexity impaired generalization in other models.
The reliability and stability of our final logistic regression model were rigorously interrogated through a comprehensive validation framework. Internally, the model showed consistent performance with minimal degradation from cross-validation (mean AUC = 0.788) to the held-out test set (AUC = 0.777), while calibration metrics revealed excellent agreement between predicted probabilities and observed outcomes (Brier score = 0.144). Externally, the model maintained robust discrimination (AUC = 0.797) in an independent validation cohort (n = 324) with consistent sensitivity (0.730) and specificity (0.752). Furthermore, biological plausibility was confirmed through SHAP analysis, which demonstrated consistent risk directionality across cohorts (higher NLR/NIHSS/glucose increasing risk, higher ASPECTS reducing risk), thereby reinforcing model stability.
When compared to existing prognostic models, our approach demonstrates three distinct advantages. First, it exhibits superior generalizability, maintaining robust discrimination (AUC > 0.777) across both internal validation and independent external cohorts—a critical advancement for real-world implementation often lacking in single-center models. Second, it offers unmatched clinical practicality through its parsimonious nature, utilizing only five clinically routinely available variables without requiring specialized tests, complex computations, or additional costs. Third, it provides innovative biological interpretability through SHAP analysis, which not only quantifies feature contributions but also reveals clinically actionable interactions between predictors, enabling personalized risk assessment beyond conventional scoring systems.
SHAP interpretability analysis affirmed the central role of our five predictors. The analysis indicated that higher baseline NIHSS scores indicate heightened initial neurological deficits (53), and higher blood glucose synergistically amplify adverse outcome risks, consistent with cerebrovascular pathogenesis (54). Several mechanisms underlying stress-induced hyperglycemia pathology may explain this: Hepatic overproduction and insulin resistance further impair rt-PA fibrinolysis, compromises blood–brain barrier (BBB) integrity, and aggravate cerebral edema (55, 56). Conversely, diminished ASPECTS, signifying extensive early parenchymal injury (57, 58), and pre-existing atrial fibrillation emerged as independent radiological and comorbid risk factors. Mechanistically, atrial fibrillation promotes cardioembolic clot resistance to lysis during thrombolysis, worsening ischemic injury (59, 60). Crucially, NLR ranked as a principal effector, outperforming conventional biomarkers (61, 62). Substantial evidence confirms that inflammatory mediators critically influence cerebral ischemia pathogenesis (63–65). Following stroke onset, neutrophils constitute the earliest responding immune cells that amplify tissue damage through chemokine release and matrix metalloproteinase-9 (MMP-9) overexpression (26, 61), exacerbating BBB disruption and contributing to hemorrhagic transformation (66). Notably, neutrophil extracellular traps (NETs) demonstrate elevated plasma levels that correlate with poor prognoses (26). Supporting the importance of neurological severity, several studies indicate that baseline NIHSS scores correlate with 3-month functional recovery after IVT (67, 68). Additionally, clinical metrics like NLR independently predict hemorrhagic complications and mortality (61, 62). The use of LR, reinforced by SHAP interpretation, enhances clinical trust and facilitates integration into decision-making workflows.
However, our study also has several limitations. First, biomarkers were measured at a single timepoint, lacking temporal dynamics. Second, despite external validation, model development relied on retrospective data from predominantly single-center cohorts; prospective multi-center validation remains essential. Third, incorporating additional potentially relevant factors such as detailed imaging biomarkers, genetic markers, or more detailed inflammatory profiling might improve predictive accuracy. Fourth, although inflammatory biomarker selection prioritized NLR based on existing evidence, future research is needed to directly compare a broader panel of indices (such as PLR, LMR, SII, PIV, and SIRI). Fifth, a complete-case analysis approach was used for handling missing data. Although comparative analysis demonstrated no systematic differences between patients excluded for missing data and the final cohort, mitigating immediate selection bias concerns, future prospective studies should employ advanced techniques such as multiple imputation to further enhance robustness. Our model demonstrates utility in facilitating identifying for AIS patients after IVT at elevated risk of unfavorable 90-day prognosis.
This research establishes a clinically feasible model using five routinely available predictors for post-IVT adverse outcomes in AIS. Logistic Regression demonstrated superior overall performance compared to seven other machine learning algorithms, achieving optimal balance between discrimination, calibration, and clinical utility. SHAP analysis quantified individual predictor contributions, confirming NLR as the most influential risk determinant. Our validated model provides a stratification tool to identify high-risk patients, enabling personalized interventions to mitigate poor 3-month outcomes.
Data availability statement
The original contributions presented in the study are included in the article/Supplementary material, further inquiries can be directed to the corresponding authors.
Ethics statement
The studies involving humans were approved by the Research Ethics Committee at the Affiliated Hospital of Xuzhou Medical University (Approval number: XYFY2025-KL044-01). The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent from the patients/participants or patients/participants' legal guardian/next of kin was not required to participate in this study in accordance with the national legislation and the institutional requirements.
Author contributions
FB: Formal analysis, Methodology, Project administration, Visualization, Data curation, Conceptualization, Validation, Software, Writing – original draft, Investigation, Resources, Writing – review & editing. RC: Investigation, Writing – review & editing, Software, Writing – original draft, Data curation, Validation, Methodology. WZ: Investigation, Visualization, Writing – review & editing, Methodology, Validation, Data curation, Writing – original draft. XT: Investigation, Writing – original draft, Writing – review & editing, Visualization, Validation, Methodology, Data curation. GC: Software, Conceptualization, Writing – review & editing, Resources, Supervision, Data curation, Writing – original draft, Visualization, Project administration, Validation, Formal analysis, Methodology. XY: Writing – original draft, Writing – review & editing, Data curation, Supervision, Methodology, Conceptualization, Software, Project administration, Visualization, Funding acquisition, Resources, Validation, Formal analysis.
Funding
The author(s) declare that financial support was received for the research and/or publication of this article. This study was supported by the National Natural Science Foundation of China (81671269), the Summit of Six Top Talents Program of Jiangsu Province (WSN-120), the postdoctoral program for Dr. Yang (2015M571822 and 2016T90508) and the Young Physician Reserve Talents Program of Xuzhou (to XY).
Acknowledgments
We would like to express our deep gratitude to Hongze District People’s Hospital for allowing us to carry out this study.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The authors declare that no Gen AI was used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fneur.2025.1668816/full#supplementary-material
References
1. Saini, V, Guada, L, and Yavagal, DR. Global epidemiology of stroke and access to acute ischemic stroke interventions. Neurology. (2021) 97:S6–s16. doi: 10.1212/wnl.0000000000012781
2. GBD 2021 Stroke Risk Factor Collaborators. Global, regional, and national burden of stroke and its risk factors, 1990-2021: a systematic analysis for the global burden of disease study 2021. Lancet Neurol. (2024) 23:973–1003. doi: 10.1016/s1474-4422(24)00369-7
3. Wang, YJ, Li, ZX, Gu, HQ, Zhai, Y, Jiang, Y, Zhao, XQ, et al. China stroke statistics 2019: a report from the National Center for healthcare quality Management in Neurological Diseases, China National Clinical Research Center for neurological diseases, the Chinese Stroke Association, National Center for chronic and non-communicable disease control and prevention, Chinese Center for Disease Control and Prevention and institute for global neuroscience and stroke collaborations. Stroke Vasc Neurol. (2020) 5:211–39. doi: 10.1136/svn-2020-000457
4. Feigin, VL, and Owolabi, MO. Pragmatic solutions to reduce the global burden of stroke: a world stroke organization-lancet neurology commission. Lancet Neurol. (2023) 22:1160–206. doi: 10.1016/s1474-4422(23)00277-6
5. Nguyen, TN, Abdalkader, M, Fischer, U, Qiu, Z, Nagel, S, Chen, HS, et al. Endovascular management of acute stroke. Lancet. (2024) 404:1265–78. doi: 10.1016/s0140-6736(24)01410-7
6. Mendelson, SJ, and Prabhakaran, S. Diagnosis and management of transient ischemic attack and acute ischemic stroke: a review. JAMA. (2021) 325:1088–98. doi: 10.1001/jama.2020.26867
7. Feigin, VL, Norrving, B, and Mensah, GA. Global burden of stroke. Circ Res. (2017) 120:439–48. doi: 10.1161/circresaha.116.308413
8. Donaldson, J, Winders, J, Alamri, Y, Knight, D, and Wu, TY. The changing landscape of intravenous thrombolysis for acute Ischaemic stroke. J Clin Med. (2024) 13:5826. doi: 10.3390/jcm13195826
9. Katsanos, AH, and Tsivgoulis, G. Is intravenous thrombolysis still necessary in patients who undergo mechanical Thrombectomy? Curr Opin Neurol. (2019) 32:3–12. doi: 10.1097/wco.0000000000000633
10. Rinkel, LA, Ospel, JM, Kappelhof, M, Sehgal, A, McDonough, RV, Tymianski, M, et al. Comparing early National Institutes of Health stroke scale versus 90-day modified Rankin scale outcomes in acute ischemic stroke trials: a systematic review and analysis. J Am Heart Assoc. (2025) 14:e040304. doi: 10.1161/jaha.124.040304.
11. Suzuki, K, Matsumaru, Y, Takeuchi, M, Morimoto, M, Kanazawa, R, Takayama, Y, et al. Effect of mechanical Thrombectomy without vs with intravenous thrombolysis on functional outcome among patients with acute ischemic stroke: the Skip randomized clinical trial. JAMA. (2021) 325:244–53. doi: 10.1001/jama.2020.23522
12. Slot, KB, Berge, E, Dorman, P, Lewis, S, Dennis, M, and Sandercock, P. Impact of functional status at six months on long term survival in patients with ischaemic stroke: prospective cohort studies. BMJ. (2008) 336:376–9. doi: 10.1136/bmj.39456.688333.BE
13. Zhou, Z, Yin, X, Niu, Q, Liang, S, Mu, C, and Zhang, Y. Risk factors and a nomogram for predicting intracranial hemorrhage in stroke patients undergoing thrombolysis. Neuropsychiatr Dis Treat. (2020) 16:1189–97. doi: 10.2147/ndt.S250648
14. Yeo, LLL, Chien, SC, Lin, JR, Liow, CW, Lee, JD, Peng, TI, et al. Derivation and validation of a scoring system for intravenous tissue plasminogen activator use in Asian patients. J Stroke Cerebrovasc Dis. (2017) 26:1695–703. doi: 10.1016/j.jstrokecerebrovasdis.2017.03.033
15. Wu, Y, Chen, H, Liu, X, Cai, X, Kong, Y, Wang, H, et al. A new nomogram for individualized prediction of the probability of hemorrhagic transformation after intravenous thrombolysis for ischemic stroke patients. BMC Neurol. (2020) 20:426. doi: 10.1186/s12883-020-02002-w
16. Xia, Y, Li, L, Liu, P, Zhai, T, and Shi, Y. Machine learning prediction model for functional prognosis of acute ischemic stroke based on MRI Radiomics of white matter Hyperintensities. BMC Med Imaging. (2025) 25:91. doi: 10.1186/s12880-025-01632-1
17. Matzinger, P. Tolerance, danger, and the extended family. Annu Rev Immunol. (1994) 12:991–1045. doi: 10.1146/annurev.iy.12.040194.005015
18. Bonaventura, A, Liberale, L, Vecchié, A, Casula, M, Carbone, F, Dallegri, F, et al. Update on inflammatory biomarkers and treatments in ischemic stroke. Int J Mol Sci. (2016) 17:967. doi: 10.3390/ijms17121967
19. Macrez, R, Ali, C, Toutirais, O, Le Mauff, B, Defer, G, Dirnagl, U, et al. Stroke and the immune system: from pathophysiology to new therapeutic strategies. Lancet Neurol. (2011) 10:471–80. doi: 10.1016/s1474-4422(11)70066-7
20. Jickling, GC, Ander, BP, Stamova, B, Zhan, X, Liu, D, Rothstein, L, et al. Rna in blood is altered prior to hemorrhagic transformation in ischemic stroke. Ann Neurol. (2013) 74:232–40. doi: 10.1002/ana.23883
21. Song, SY, Zhao, XX, Rajah, G, Hua, C, Kang, RJ, Han, YP, et al. Clinical significance of baseline neutrophil-to-lymphocyte ratio in patients with ischemic stroke or hemorrhagic stroke: an updated Meta-analysis. Front Neurol. (2019) 10:1032. doi: 10.3389/fneur.2019.01032
22. Zera, KA, and Buckwalter, MS. The local and peripheral immune responses to stroke: implications for therapeutic development. Neurotherapeutics. (2020) 17:414–35. doi: 10.1007/s13311-020-00844-3
23. Berchtold, D, Priller, J, Meisel, C, and Meisel, A. Interaction of microglia with infiltrating immune cells in the different phases of stroke. Brain Pathol. (2020) 30:1208–18. doi: 10.1111/bpa.12911
24. Peña-Martínez, C, Durán-Laforet, V, García-Culebras, A, Ostos, F, Hernández-Jiménez, M, Bravo-Ferrer, I, et al. Pharmacological modulation of neutrophil extracellular traps reverses thrombotic stroke Tpa (tissue-type plasminogen activator) resistance. Stroke. (2019) 50:3228–37. doi: 10.1161/strokeaha.119.026848
25. Kang, L, Yu, H, Yang, X, Zhu, Y, Bai, X, Wang, R, et al. Neutrophil extracellular traps released by neutrophils impair revascularization and vascular remodeling after stroke. Nat Commun. (2020) 11:2488. doi: 10.1038/s41467-020-16191-y
26. Denorme, F, Portier, I, Rustad, JL, Cody, MJ, de Araujo, CV, Hoki, C, et al. Neutrophil extracellular traps regulate ischemic stroke brain injury. J Clin Invest. (2022) 132:225. doi: 10.1172/jci154225
27. Endres, M, Moro, MA, Nolte, CH, Dames, C, Buckwalter, MS, and Meisel, A. Immune pathways in etiology, acute phase, and chronic sequelae of ischemic stroke. Circ Res. (2022) 130:1167–86. doi: 10.1161/circresaha.121.319994
28. Zhu, B, Pan, Y, Jing, J, Meng, X, Zhao, X, Liu, L, et al. Neutrophil counts, neutrophil ratio, and new stroke in minor ischemic stroke or Tia. Neurology. (2018) 90:e1870–8. doi: 10.1212/wnl.0000000000005554
29. Ducroux, C, Di Meglio, L, Loyau, S, Delbosc, S, Boisseau, W, Deschildre, C, et al. Thrombus neutrophil extracellular traps content impair Tpa-induced thrombolysis in acute ischemic stroke. Stroke. (2018) 49:754–7. doi: 10.1161/strokeaha.117.019896
30. Meisel, C, Schwab, JM, Prass, K, Meisel, A, and Dirnagl, U. Central nervous system injury-induced immune deficiency syndrome. Nat Rev Neurosci. (2005) 6:775–86. doi: 10.1038/nrn1765
31. Dirnagl, U, Klehmet, J, Braun, JS, Harms, H, Meisel, C, Ziemssen, T, et al. Stroke-induced immunodepression: experimental evidence and clinical relevance. Stroke. (2007) 38:770–3. doi: 10.1161/01.STR.0000251441.89665.bc
32. Liesz, A, Rüger, H, Purrucker, J, Zorn, M, Dalpke, A, Möhlenbruch, M, et al. Stress mediators and immune dysfunction in patients with acute cerebrovascular diseases. PLoS One. (2013) 8:e74839. doi: 10.1371/journal.pone.0074839
33. Gong, P, Liu, Y, Gong, Y, Chen, G, Zhang, X, Wang, S, et al. The Association of Neutrophil to lymphocyte ratio, platelet to lymphocyte ratio, and lymphocyte to monocyte ratio with post-thrombolysis early neurological outcomes in patients with acute ischemic stroke. J Neuroinflammation. (2021) 18:51. doi: 10.1186/s12974-021-02090-6
34. Wang, C, Zhou, M, Kang, T, You, S, Cao, Y, Kong, W, et al. The prognostic value of combined uric acid and neutrophil-to-lymphocyte ratio in acute ischemic stroke patients treated with intravenous thrombolysis. BMC Neurol. (2024) 24:183. doi: 10.1186/s12883-024-03628-w
35. Wang, C, Zhang, Q, Ji, M, Mang, J, and Xu, Z. Prognostic value of the neutrophil-to-lymphocyte ratio in acute ischemic stroke patients treated with intravenous thrombolysis: a systematic review and Meta-analysis. BMC Neurol. (2021) 21:191. doi: 10.1186/s12883-021-02222-8
36. Heo, J, Yoon, JG, Park, H, Kim, YD, Nam, HS, and Heo, JH. Machine learning-based model for prediction of outcomes in acute stroke. Stroke. (2019) 50:1263–5. doi: 10.1161/strokeaha.118.024293
37. Lee, EJ, Kim, YH, Kim, N, and Kang, DW. Deep into the brain: artificial intelligence in stroke imaging. J Stroke. (2017) 19:277–85. doi: 10.5853/jos.2017.02054
38. McEligot, AJ, Poynor, V, Sharma, R, and Panangadan, A. Logistic lasso regression for dietary intakes and breast cancer. Nutrients. (2020) 12:652. doi: 10.3390/nu12092652
39. Daneshvar, A, and Mousa, G. Regression shrinkage and selection via least quantile shrinkage and selection operator. PLoS One. (2023) 18:e0266267. doi: 10.1371/journal.pone.0266267
40. Ye, Q, Zhai, F, Chao, B, Cao, L, Xu, Y, Zhang, P, et al. Rates of intravenous thrombolysis and endovascular therapy for acute Ischaemic stroke in China between 2019 and 2020. Lancet Reg Health West Pac. (2022) 21:100406. doi: 10.1016/j.lanwpc.2022.100406
41. Liang, W, Liang, H, Ou, L, Chen, B, Chen, A, Li, C, et al. Development and validation of a clinical risk score to predict the occurrence of critical illness in hospitalized patients with Covid-19. JAMA Intern Med. (2020) 180:1081–9. doi: 10.1001/jamainternmed.2020.2033
42. Wei, S, Zhang, H, Li, H, Li, C, Shen, Z, Yin, Y, et al. Establishment and validation of predictive model of Ards in critically ill patients. J Transl Med. (2025) 23:64. doi: 10.1186/s12967-024-06054-1
43. Ponce-Bobadilla, AV, Schmitt, V, Maier, CS, Mensing, S, and Stodtmann, S. Practical guide to Shap analysis: explaining supervised machine learning model predictions in drug development. Clin Transl Sci. (2024) 17:e70056. doi: 10.1111/cts.70056
44. Demeestere, J, Scheldeman, L, Cornelissen, SA, Heye, S, Wouters, A, Dupont, P, et al. Alberta stroke program early ct score versus computed tomographic perfusion to predict functional outcome after successful reperfusion in acute ischemic stroke. Stroke. (2018) 49:2361–7. doi: 10.1161/STROKEAHA.118.021961
45. Banks, JL, and Marotta, CA. Outcomes validity and reliability of the modified Rankin scale: implications for stroke clinical trials: a literature review and synthesis. Stroke. (2007) 38:1091–6. doi: 10.1161/01.STR.0000258355.23810.c6
46. Tibshirani, R. Regression shrinkage and selection via the lasso. J R Stat Soc B Stat Methodol. (2018) 58:267–88. doi: 10.1111/j.2517-6161.1996.tb02080.x
47. Kang, J, Choi, YJ, Kim, IK, Lee, HS, Kim, H, Baik, SH, et al. Lasso-based machine learning algorithm for prediction of lymph node metastasis in T1 colorectal Cancer. Cancer Res Treat. (2021) 53:773–83. doi: 10.4143/crt.2020.974
48. Lundberg, SM, Erion, G, Chen, H, DeGrave, A, Prutkin, JM, Nair, B, et al. From local explanations to global understanding with explainable Ai for trees. Nat Mach Intell. (2020) 2:56–67. doi: 10.1038/s42256-019-0138-9
49. Sauerbrei, W, Royston, P, and Binder, H. Selection of important variables and determination of functional form for continuous predictors in multivariable model building. Stat Med. (2007) 26:5512–28. doi: 10.1002/sim.3148
50. Pourhoseingholi, MA, Baghestani, AR, and Vahedi, M. How to control confounding effects by statistical analysis. Gastroenterol Hepatol Bed Bench. (2012) 5:79–83. doi: 10.22037/ghfbb.v5i2.246
51. Ou, Q, Jin, W, Lin, L, Lin, D, Chen, K, and Quan, H. Lasso-based machine learning algorithm to predict the incidence of diabetes in different stages. Aging Male. (2023) 26:2205510. doi: 10.1080/13685538.2023.2205510
52. Beam, AL, and Kohane, IS. Big data and machine learning in health care. JAMA. (2018) 319:1317–8. doi: 10.1001/jama.2017.18391
53. Heitsch, L, Ibanez, L, Carrera, C, Binkley, MM, Strbian, D, Tatlisumak, T, et al. Early neurological change after ischemic stroke is associated with 90-day outcome. Stroke. (2021) 52:132–41. doi: 10.1161/strokeaha.119.028687
54. Chen, G, Ren, J, Huang, H, Shen, J, Yang, C, Hu, J, et al. Admission random blood glucose, fasting blood glucose, stress hyperglycemia ratio, and functional outcomes in patients with acute ischemic stroke treated with intravenous thrombolysis. Front Aging Neurosci. (2022) 14:782282. doi: 10.3389/fnagi.2022.782282
55. Dietrich, WD, Alonso, O, and Busto, R. Moderate hyperglycemia worsens acute blood-brain barrier injury after forebrain ischemia in rats. Stroke. (1993) 24:111–6. doi: 10.1161/01.str.24.1.111
56. Wang, Y, Jiang, G, Zhang, J, Wang, J, You, W, and Zhu, J. Blood glucose level affects prognosis of patients who received intravenous thrombolysis after acute ischemic stroke? A meta-analysis. Front Endocrinol. (2023) 14:1120779. doi: 10.3389/fendo.2023.1120779
57. Deng, PP, Wu, N, Chen, XJ, Chen, FL, Xu, HS, and Bao, GS. Nihss-the Alberta stroke program early Ct score mismatch in guiding thrombolysis in patients with acute ischemic stroke. J Neurol. (2022) 269:1515–21. doi: 10.1007/s00415-021-10704-5
58. Huang, L, Liu, Q, Lu, X, Liu, S, Cao, C, Wang, Z, et al. Impact of Encephalomalacia and white matter Hyperintensities on Aspects in patients with acute ischemic stroke: comparison of automated and radiologist-derived scores. AJR Am J Roentgenol. (2022) 218:878–87. doi: 10.2214/ajr.21.26819
59. Uchino, K. Anticoagulation conundrum in acute ischemic stroke with atrial fibrillation. N Engl J Med. (2023) 388:2479–80. doi: 10.1056/NEJMe2304801
60. Sharma, M, and Srivastava, A. In acute ischemic stroke with Af, early Doac initiation was noninferior to delayed initiation for ischemic and hemorrhagic outcomes at 90 D. Ann Intern Med. (2025) 178:Jc16. doi: 10.7326/annals-24-03879-jc
61. Iadecola, C, and Anrather, J. The immunology of stroke: from mechanisms to translation. Nat Med. (2011) 17:796–808. doi: 10.1038/nm.2399
62. Buck, BH, Liebeskind, DS, Saver, JL, Bang, OY, Yun, SW, Starkman, S, et al. Early neutrophilia is associated with volume of ischemic tissue in acute stroke. Stroke. (2008) 39:355–60. doi: 10.1161/strokeaha.107.490128.
63. Parikh, NS, Merkler, AE, and Iadecola, C. Inflammation, autoimmunity, infection, and stroke: epidemiology and lessons from therapeutic intervention. Stroke. (2020) 51:711–8. doi: 10.1161/strokeaha.119.024157
64. Shi, K, Zou, M, Jia, DM, Shi, S, Yang, X, Liu, Q, et al. Tpa mobilizes immune cells that exacerbate hemorrhagic transformation in stroke. Circ Res. (2021) 128:62–75. doi: 10.1161/circresaha.120.317596
65. Gong, P, Xie, Y, Jiang, T, Liu, Y, Wang, M, Sun, H, et al. Neutrophil-lymphocyte ratio predicts post-thrombolysis early neurological deterioration in acute ischemic stroke patients. Brain Behav. (2019) 9:e01426. doi: 10.1002/brb3.1426
66. Weisenburger-Lile, D, Dong, Y, Yger, M, Weisenburger, G, Polara, GF, Chaigneau, T, et al. Harmful neutrophil subsets in patients with ischemic stroke: association with disease severity. Neurol Neuroimmunol Neuroinflamm. (2019) 6:e571. doi: 10.1212/nxi.0000000000000571
67. Sato, S, Toyoda, K, Uehara, T, Toratani, N, Yokota, C, Moriwaki, H, et al. Baseline Nih stroke scale score predicting outcome in anterior and posterior circulation strokes. Neurology. (2008) 70:2371–7. doi: 10.1212/01.wnl.0000304346.14354.0b
68. Kharitonova, T, Mikulik, R, Roine, RO, Soinne, L, Ahmed, N, and Wahlgren, N. Association of early National Institutes of Health stroke scale improvement with vessel recanalization and functional outcome after intravenous thrombolysis in ischemic stroke. Stroke. (2011) 42:1638–43. doi: 10.1161/strokeaha.110.606194
Keywords: acute ischemic stroke, intravenous thrombolysis, machine learning, outcome, SHapley Additive exPlanations
Citation: Bu F, Cai R, Zhang W, Tang X, Cui G and Yang X (2025) Explainable machine learning model for predicting the outcome of acute ischemic stroke after intravenous thrombolysis. Front. Neurol. 16:1668816. doi: 10.3389/fneur.2025.1668816
Edited by:
Jean-Claude Baron, University of Cambridge, United KingdomReviewed by:
Chenghua Xu, Taizhou First People's Hospital, ChinaLiam Morris, Galway-Mayo Institute of Technology, Ireland
Copyright © 2025 Bu, Cai, Zhang, Tang, Cui and Yang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Guiyun Cui, Y3VpZ3VpeXVuMTAxN0AxMjYuY29t; Xinxin Yang, bmV1cm9sb2d5YW5nQDEyNi5jb20=
†These authors have contributed equally to this work and share first authorship
‡These authors have contributed equally to this work and share last authorship