Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Neurosci., 07 August 2025

Sec. Brain Imaging Methods

Volume 19 - 2025 | https://doi.org/10.3389/fnins.2025.1643014

This article is part of the Research TopicAdvances in brain diseases: leveraging multimodal data and artificial intelligence for diagnosis, prognosis, and treatmentView all 3 articles

Development and validation of a CT-based multi-omics nomogram for predicting hospital discharge outcomes following mechanical thrombectomy

Feifan LiuFeifan Liu1Jiayi HongJiayi Hong1Yuhan ChenYuhan Chen1Huan LiuHuan Liu1Yue WangYue Wang2Lijian SuLijian Su2Sheng Hu
Sheng Hu2*Jingjing Fu
Jingjing Fu1*
  • 1Department of Neurology, The Fourth Affiliated Hospital of School of Medicine, and International School of Medicine, International Institutes of Medicine, Zhejiang University, Yiwu, China
  • 2Department of Radiology, The Fourth Affiliated Hospital of School of Medicine, and International School of Medicine, International Institutes of Medicine, Zhejiang University, Yiwu, China

Objective: This study aimed to develop a multi-omics nomogram that combines clinical parameters, radiomics, and deep transfer learning (DTL) features of hyperattenuated imaging markers (HIM) from computed tomography scans immediately following mechanical thrombectomy (MT) to predict functional outcomes at discharge.

Methods: This study enrolled 246 patients with HIM who underwent MT. Patients were randomly assigned to a training cohort (n = 197, 80%) and a validation cohort (n = 49, 20%), with an additional internal prospective test cohort (n = 57). A total of 1,834 radiomics features and 25,088 DTL features were extracted from HIM images. Feature selection was conducted using analysis of variance (ANOVA), Pearson’s correlation, principal component analysis (PCA), and least absolute shrinkage and selection operator (LASSO) regression. A support vector machine (SVM)-based nomogram integrating clinical, radiomics, and DTL features was developed to predict functional outcomes at discharge. Its performance was evaluated based on accuracy, sensitivity, specificity, receiver operating characteristic (ROC) curve and area under the curve (AUC) analysis, decision curve analysis (DCA), and the DeLong test.

Results: The nomogram achieved AUCs of 0.995 (95% CI: 0.989–1.000) in training, 0.959 (95% CI: 0.910–1.000) in validation, and 0.894 (95% CI: 0.807–0.981) in test cohorts. Our nomogram significantly outperformed clinical, radiomics, and DTL models, as well as physician assessments (senior physicians: 0.693, p = 0.001; junior physicians: 0.600, p < 0.001).

Conclusion: This multi-omics nomogram, integrating HIM-derived, clinical, radiomic, and DTL features, accurately predicts post-MT discharge outcomes, enabling early identification of high-risk patients and optimizing management to improve prognosis.

1 Introduction

The widespread adoption of mechanical thrombectomy (MT) has significantly improved vessel recanalization rates in patients with acute large vessel occlusive stroke. Nevertheless, despite these advancements, more than 50% of patients continue to experience moderate-to-severe disability, as indicated by a modified Rankin Scale (mRS) score greater than 2 at 90 days postoperatively (Morsi et al., 2024). Obtaining reliable 90-day mRS data from stroke survivors is challenging, requiring substantial medical resources and often leading to delays that may hinder timely rehabilitation. Early identification of high-risk individuals and targeted interventions are essential for optimizing rehabilitation resource allocation and improving long-term outcomes (Li et al., 2024; Taleb et al., 2023). Traditional prognostic models, such as the MT-DRAGON and ASTRAL scores, rely primarily on preoperative clinical and imaging parameters and use linear regression methods, which are insufficient for capturing the complex and dynamic biological changes after MT (Cooray et al., 2016). Moreover, many studies focus on 90-day outcome assessments while overlooking the predictive value of discharge outcomes. Clinical evidence shows a strong correlation between discharge status and 90-day prognosis, with patients who have poor discharge outcomes being highly likely to experience unfavorable long-term outcomes, underscoring the importance of early intervention (Cooray et al., 2016; ElHabr et al., 2021; Karamchandani et al., 2021; Chen et al., 2022). Therefore, there is an urgent need for a predictive tool that can identify patients at risk of poor discharge outcomes, enabling timely interventions to improve long-term prognoses.

Hyperattenuating imaging markers (HIM) have become a focal point in prognostic modeling (Dong et al., 2024). Non-contrast computed tomography (NCCT) performed immediately after mechanical thrombectomy detects HIM in 31 to 84% of cases (Lummel et al., 2014). These hyperdense findings typically indicate blood–brain barrier disruption, most often due to contrast extravasation and, less commonly, post-procedural hemorrhage (Hu et al., 2025). Several studies have identified HIM as a key predictor of increased mortality and poor outcomes following MT, highlighting its value as a prognostic marker (Chen et al., 2019; Jiang et al., 2021). Integrating HIM-based predictive models with artificial intelligence (AI) enhances the prediction of post-thrombectomy complications by analyzing complex imaging patterns and multidimensional data, improving accuracy and sensitivity beyond traditional methods (Hu et al., 2024; Song et al., 2025). This approach provides a more robust framework for clinical decision-making, translating imaging insights into actionable strategies.

Artificial intelligence, including both deep transfer learning (DTL) and radiomics, has emerged as a promising tool for predicting outcomes in stroke patients following MT, offering high precision and the potential to enhance diagnostic and therapeutic strategies. Deep learning facilitates multi-level feature extraction directly from raw imaging data, demonstrating significant predictive capabilities (Liu Y. et al., 2023). Similarly, radiomics has garnered attention for its ability to analyze imaging features and predict functional outcomes post-MT (Heo et al., 2019). While both approaches have robust strengths, their comparative performance depends on the context, with neither consistently outperforming the other (Liu W. et al., 2023). The present study aims to integrate DTL, radiomics, and clinical features, combining the strengths of multiple models to achieve more accurate predictions of discharge outcomes in stroke patients post-MT. This integrated approach seeks to optimize resource allocation, enable early personalized interventions, and improve rehabilitation planning, ultimately enhancing long-term prognosis.

2 Methods

2.1 Ethical approval of the study protocol

This study was approved by the Ethics Committee (Approval Number: K2024139). During the prospective phase, the model was deployed without any clinical interventions or changes to standard care. After the prospective period, data were retrieved from the research database, along with our model’s predictions, for downstream analysis. All clinical investigations were conducted under the principles outlined in the Declaration of Helsinki.

2.2 Patient selection and study design

We retrospectively reviewed patients diagnosed with acute ischemic stroke due to intracranial large vessel occlusion (LVO) who underwent endovascular MT between June 2016 and December 2023. The indications and contraindications for MT and thrombolysis were based on the most current guidelines available at the time of treatment. General clinical characteristics, laboratory findings, clinical presentations, and imaging data were collected. Inclusion criteria were as follows: (1) patients underwent head NCCT post-MT; (2) initial postoperative NCCT was performed within 1 h after MT; (3) HIM, defined as hyperattenuation in the brain parenchyma or subarachnoid space, were detected on the initial post-MT NCCT. Exclusion criteria included: (1) incomplete mRS score at discharge; (2) unsuccessful MT, such as guidewire failure to reach the occlusion site or angiography confirming vessel recanalization; (3) absence of HIM; (4) imaging artifacts compromising HIM assessment; (5) administration of iodinated contrast before preoperative CT. A flowchart of the study population is presented in Figure 1.

Figure 1
Flowchart depicting patient selection process from initial and internal databases for a study. Initial database: 438 patients, excluded 192 for various reasons including incomplete mRS and unsuccessful MT. Internal database: 93 patients, excluded 36. A total of 246 eligible patients from the initial database were divided into training (197) and validation (49) cohorts. An additional 57 patients from the internal database were included in the test cohort.

Figure 1. The patient selection flowchart.

2.3 Imaging

Initial postoperative non-enhanced head CT scans were acquired within 1 h post-thrombectomy using either a 64-row spiral CT scanner (Somatom® Definition AS, Siemens Healthineers, Forchheim, Germany) or a 62-row spiral scanner (Optima® CT620, GE Medical Systems, Milwaukee, WI, USA). Scanning parameters were as follows: axial mode, tube voltage of 120 kV, tube current of 250–300 mAs, coverage from the skull base to the cranial vertex, a section thickness of 5 mm, and reconstruction with a standard algorithm. Two neuroradiologists with over 10 years of experience and blinded to the clinical data independently evaluated imaging data in randomized order. Any discrepancies were resolved through consensus discussion.

The workflow and global analysis pipeline for the classification model are illustrated in Figure 2. HIM was manually delineated on each CT image, and the region of interest (ROI) encompassing the HIM was extracted for input into the predictive model. To ensure the reliability and reproducibility of radiomics features, 30 lesions were randomly selected, and intra-class correlation coefficients (ICC) were used to assess inter- and intra-observer agreement in feature extraction. Reader A performed ROI segmentation twice, with a 1-month interval, to evaluate intra-observer agreement. Reader B segmented the 30 lesions once, and the extracted radiomics features were used to assess inter-observer agreement. Features with ICC > 0.75 were deemed reliable and retained for model construction, resulting in 1,725 features with excellent agreement (ICC > 0.75). Additionally, segmentation consistency was quantified using the Dice similarity coefficient (DSC), yielding a mean DSC of 0.954, indicating high agreement between the two neuroradiologists. DTL and radiomics features were extracted separately: DTL features were derived from the largest cross-sectional ROI, while radiomics features were computed from the entire ROI volume. Feature selection was conducted sequentially using analysis of variance (ANOVA), Pearson’s correlation coefficient, and the least absolute shrinkage and selection operator (LASSO).

Figure 2
Flowchart illustrating a diagnostic framework integrating clinical history, radiomics, and deep transfer learning for predicting outcomes. It includes CT imaging analysis, feature extraction, LASSO, and SVM classification. Nomogram and model performance charts compare training, validation, and testing phases.

Figure 2. The overall framework of the proposed model. ROI, region of interest; LASSO, least absolute shrinkage and selection operator; SVM, support vector machine; Grad Cam, Gradient-weighted Class Activation Mapping; AUC, area under curve.

2.4 Data collection

In this study, 303 patients were enrolled from June 2016 to December 2023. Of these, 246 patients treated between June 2016 and January 2023 were allocated to model training and validation: 197 (80%) were randomly assigned to the training cohort, and 49 (20%) to the validation cohort. An additional 57 patients from February 2023 to December 2023 were included in the internal test cohort. Functional outcomes at discharge were evaluated using the mRS (range: 0–6), with unfavorable outcomes defined as mRS scores of 4–6, extracted from medical records by trained clinicians. Discrepancies were resolved through consultation.

By reviewing patient charts and procedure notes, we gathered the following clinical indicators and surgical details: age, sex, hyperlipidemia, hypertension, diabetes mellitus, coronary artery disease, atrial fibrillation, history of prior stroke, history of alcohol consumption or smoking, baseline National Institutes of Health Stroke Scale (NIHSS) scores, length of hospital stay, pre-stroke mRS scores, admission mRS scores, thrombolysis status, stent implantation, modified Thrombolysis in Cerebral Infarction (mTICI) scores, use of aspiration catheters, and number of thrombectomy attempts. Imaging metrics, including the Alberta Stroke Program Early CT Score (ASPECTS) for anterior and posterior circulation, subarachnoid HIM, HUmax values, and other relevant parameters, were determined through consensus between two neuroradiologists, each with over 5 years of experience and blinded to the patients’ clinical information.

2.5 Feature extraction and selection

Radiomic features were extracted from HIM on NCCT using an in-house feature analysis program based on Pyradiomics1. To eliminate any potential variations in CT images obtained using different CT scanners, NCCT images were reconstructed using a voxel size of 1 × 1 × 1 mm3 and gray-scale discretization. A total of 1,834 radiomic features were computed, covering shape, first-order statistics, gray-level co-occurrence matrix, gray-level run-length matrix, gray-level size zone matrix, neighboring gray-tone difference matrix, gray-level dependence matrix, and wavelet features.

DTL features were extracted using five models: VGG19_bn, Inception v3, ResNet50, VGG16, and ResNet152. Among these, we selected the model with the highest predictive performance for this study, based on its superior area under the curve (AUC) value. Pre-trained on large-scale datasets, the selected model was fine-tuned using the maximum cross-sectional HIM ROI. Data augmentation techniques, including RandomResizedCrop and RandomHorizontalFlip, were applied to improve generalization. Features from the ‘avgpool’ layer generated 25,088 DTL features, which were then reduced to 303 using principal component analysis (PCA). Radiomics features were extracted and evaluated using five machine learning models: SVM, KNN, Random Forest, XGBoost, and LightGBM. Among these, SVM performed best, achieving an AUC of 0.609. As a result, a clinical-radiomics-DTL nomogram was constructed using SVM. Clinical, radiomics, and DTL features were standardized to have a mean of 0 and a variance of 1. Feature selection was performed using ANOVA, Pearson’s correlation coefficient, PCA, and LASSO to enhance the models’ generalization ability and reduce overfitting. For the clinical, radiomics, and DTL models, ANOVA, Pearson’s correlation coefficient, and LASSO were applied sequentially to complete feature selection. PCA, ANOVA, Pearson’s correlation coefficient, and LASSO were used sequentially for the combined model.

2.6 Development of the clinical signature

To develop the clinical model, we first performed univariate logistic regression analysis to identify clinical variables significantly associated with the outcome. Variables with a p-value less than 0.05 were considered candidate predictors. These selected features were then used to construct the clinical model using a support vector machine (SVM) algorithm, implemented in Python (version 3.11.13). Model performance was evaluated in the training and test cohorts using receiver operating characteristic (ROC) curves and corresponding AUC values.

2.7 Model development and validation

A nomogram integrating clinical, radiomics, and DTL features was developed, alongside separate clinical, radiomics, and DTL models for comparison. Model performance was assessed using sensitivity, specificity, receiver operating characteristic (ROC) curves, and AUC. Five-fold cross-validation, combined with grid search, was employed to optimize the hyperparameters of the SVM model. The training cohort data were divided into five folds, yielding five prediction models. For each model, sensitivity, specificity, and AUC were computed. The fold with the median AUC was selected as the representative split. Feature engineering was strictly limited to the training data in each fold to prevent data leakage and ensure unbiased evaluation. Models were trained and evaluated on the training, validation, and test cohorts. The Shapley Additive Explanations (SHAP) method was used to visualize and quantify each feature’s contribution to model predictions. Calibration curves were generated to assess the agreement between predicted probabilities and observed outcomes in the test cohort. The DeLong test was applied to compare ROC curve performance across models, and decision curve analysis (DCA) was performed to evaluate the clinical utility of the predictive models.

2.8 Implementation and hardware

The network architecture was implemented using Python and the PyTorch library. The DTL model was trained with the stochastic gradient descent algorithm as the optimizer, using a learning rate of 0.001, a mini-batch size of 8, and a binary cross-entropy loss function. Batch normalization was applied after each convolution layer to accelerate convergence and reduce overfitting. The network was trained for 50 epochs.

2.9 Sensitivity analysis of nomogram performance using 90-day mRS outcomes

To assess the consistency and robustness of the nomogram in predicting longer-term outcomes, a sensitivity analysis was conducted in the test cohort using 90-day mRS scores as the outcome measure. Patients with 90-day mRS scores of 3–6 were classified as having unfavorable outcomes.

2.10 Statistical analysis and performance evaluation

Statistical analyses were performed using SPSS software (version 27.0, IBM). Continuous variables are reported as means with standard deviations, and categorical variables as frequencies and percentages. The Kruskal–Wallis test was used to compare age, hospital stay duration, baseline NIHSS score, discharge NIHSS score, admission mRS score, and ASPECTS. Other categorical variables were analyzed using the chi-square test or Fisher’s exact test, as appropriate. Feature extraction, selection, and model construction were conducted using Python (version 3.11.13). Model performance was evaluated by calculating the AUC. Calibration curves were generated to assess agreement between predicted probabilities and observed outcomes in the test cohort. The DeLong test compared performance differences between ROC curves of different models, and DCA evaluated the clinical utility of the prediction models. A p-value < 0.05 was considered statistically significant for all tests.

3 Results

3.1 Cohort and clinical characteristics

The study included a training cohort of 197 patients, a validation cohort of 49 patients, and a test cohort of 57 patients. Clinical feature selection was performed using ANOVA, Pearson correlation, and LASSO. Table 1 details the patient characteristics. Unfavorable discharge outcomes, defined as mRS 4–6, were observed in 185 of 303 patients. Significant differences among cohorts were identified in hyperlipidemia (p = 0.010), coronary artery disease (p = 0.040), thrombolysis (p = 0.005), stent type, (p = 0.001), and aspiration catheter use (p < 0.001). A nomogram incorporating clinical, radiomics, and DTL features was developed (Figure 3). To derive the DTL features, we evaluated five convolutional neural network models: VGG19_bn, Inception v3, ResNet50, VGG16, and ResNet152. Among these, VGG19_bn demonstrated the highest performance (AUC = 0.674) and was selected for this study.

Table 1
www.frontiersin.org

Table 1. Characteristics of patients in the train, validation and test cohort.

Figure 3
A chart consisting of five scales labeled from top to bottom: Points, DTL, Rad, Clinic, Total Points, and Risk. Each scale has numerical values below indicating different measurement points. The Points scale ranges from zero to one hundred. The DTL scale ranges from 0.6462 to 0.648. The Rad scale ranges from 0.15 to 0.75. The Clinic scale ranges from 0.1 to 1. The Total Points scale ranges from zero to two hundred. The Risk scale ranges from 0.01 to 0.99.

Figure 3. A nomogram integrating radiomic features, deep learning algorithms, and clinical parameters was developed to predict the risk of unfavorable discharge outcomes.

3.2 Performance of models in the training cohort

In the training cohort, the nomogram demonstrated superior performance with an AUC of 0.995 (95% CI: 0.989–1.000), a sensitivity of 0.969, and a specificity of 0.986. The clinical model achieved an AUC of 0.924 (95% CI: 0.889–0.960), while the radiomics model and DTL model yielded AUCs of 0.792 (95% CI: 0.723–0.860) and 0.675 (95% CI: 0.600–0.751), respectively (Figure 4A).

Figure 4
Three ROC curve graphs labeled A, B, and C compare model performance. Each graph shows four lines representing different models: Nomogram, DTL, Rad, and Clinic. AUC values and confidence intervals are listed in the legends. Nomogram consistently shows the highest AUC across all graphs. X-axis represents 1-Specificity, and Y-axis represents Sensitivity.

Figure 4. ROC curves for all models across the training (A), validation (B), and test (C) cohorts. ROC, receiver operating characteristic.

3.3 Performance of models in the validation cohort

In the validation cohort, the nomogram maintained high performance with an AUC of 0.959 (95% CI: 0.910–1.000), a sensitivity of 0.903, and a specificity of 0.889. The clinical model achieved an AUC of 0.737 (95% CI: 0.597–0.876), while the radiomics model and DTL model yielded AUCs of 0.609 (95% CI: 0.431–0.788) and 0.674 (95% CI: 0.517–0.831), respectively (Figure 4B).

3.4 Performance of 5-fold cross-validation

Five independent validation results were obtained, as shown in Table 2. The fold with the median AUC (0.959) was selected to represent a typical data split for model training and evaluation.

Table 2
www.frontiersin.org

Table 2. Performance metrics from 5-fold cross-validation.

3.5 Performance of models in the test cohort

In the test cohort, our nomogram achieved an AUC of 0.894 (95% CI: 0.807–0.981), outperforming the clinical model (AUC 0.792, 95% CI: 0.674–0.910), the radiomics model (AUC 0.542, 95% CI: 0.386–0.698), and the DTL model (AUC 0.533, 95% CI: 0.378–0.689) (Figure 4C). DCA further revealed that all models improved outcome predictions compared to no-model scenarios, with the nomogram providing the most significant clinical benefit (Figure 5A). To evaluate the models’ effectiveness, the DeLong test was performed (Figure 5B). The findings indicated that the AUC of the nomogram was markedly superior to that of the radiomics model (p < 0.001) and the DTL model (p < 0.001) within the testing cohort. Moreover, the nomogram displayed a trend toward enhanced performance relative to the clinical model (p = 0.079), though this difference fell short of statistical significance. To assess the agreement between predicted and observed outcomes for the nomogram, calibration curves were generated (Figure 5C).

Figure 5
Panel A shows a Decision Curve Analysis (DCA) plot comparing net benefits of models (Nomogram, DTL, Rad, Clinic) across threshold probabilities. Panel B displays a matrix of Cohort Test Delong results with color gradients indicating significance. Panel C presents calibration curves for Nomogram, DTL, RAD, and Clinic models, showing observed versus predicted probabilities, with a reference line at a forty-five degree angle.

Figure 5. The DCA curves (A), DeLong test results (B), and calibration curves (C) for all models in the test cohort. (A) DCA curves of different models in the test cohort. The y-axis represents the net benefit, and the x-axis represents the threshold probability. DCA, Decision Curve Analysis. (B) A significant difference between models was determined by the DeLong test (p < 0.05). (C). Calibration curves for models in the test cohort. The x-axis represents predicted probabilities of unfavorable outcomes, and the y-axis represents observed probabilities.

3.6 Enhancing model interpretability using SHAP values

Figure 6 presents a SHAP bar plot, ranking features by their contribution to the model’s output in descending order, and a SHAP beeswarm plot, illustrating the impact and distribution of each feature on model predictions. The magnitude of SHAP values indicates the strength of each feature’s influence on individual predictions.

Figure 6
Graph A is a horizontal bar chart showing mean absolute SHAP values for features, with

Figure 6. SHAP values for interpreting the combined model. (A) Bar plot illustrating the relative contributions of features to the model’s output. The top-ranked feature, “Baseline NIHSS” exhibits the greatest predictive power, significantly outperforming lower-ranked features. (B) Beeswarm plot depicting SHAP values for model features. Positive SHAP values indicate a higher predicted risk of unfavorable outcome, while negative values correspond to a lower risk. NIHSS, National Institutes of Health Stroke Scale; DTL, deep transfer learning; LHH, low-high-high-pass filtered image; GLSZM, gray level size zone matrix; GLCM, gray-level co-occurrence matrix; HLH, high-low–high filtered image.

3.7 Sensitivity analysis using 90-day mRS outcomes

We performed a sensitivity analysis in the test cohort using 90-day mRS scores as the outcome measure, despite one patient being lost to follow-up at 90 days. The nomogram achieved an AUC of 0.854, suggesting good discriminatory ability for predicting long-term functional disability.

3.8 Clinically actionable threshold and risk stratification

To enhance the nomogram’s clinical utility, a probability threshold of 0.760 was established by maximizing the Youden Index (0.645) in the test cohort. At this cutoff, the model achieved a sensitivity of 0.778, specificity of 0.867, positive predictive value (PPV) of 0.840, and negative predictive value (NPV) of 0.812, striking an optimal balance between false positives and false negatives. The positive likelihood ratio (PLR) of 5.85 and negative likelihood ratio (NLR) of 0.26 demonstrate robust risk stratification.

3.9 Performance comparison of nomogram and physicians

Figure 7 illustrates the ROC curves for the nomogram and physicians. The nomogram achieves an AUC of 0.894, outperforming both the senior physician (0.693) and the junior physician (0.600). In comparison to the senior physician, the nomogram demonstrates superior accuracy (83% vs. 53%), specificity (83% vs. 80%), and sensitivity (82% vs. 62%). Similarly, when compared to the junior physician, the nomogram exhibits greater accuracy (83% vs. 53%), specificity (83% vs. 64%), and sensitivity (82% vs. 56%).

Figure 7
ROC curve chart showing model performance through sensitivity versus 1-specificity. Three curves are displayed: red solid (Nomogram, AUC 0.894), orange dashed (Senior, AUC 0.693), and yellow solid (Junior, AUC 0.600). A diagonal black dashed line serves as a reference.

Figure 7. ROC curves comparing predictive performance among the nomogram, a senior physician, and a junior physician.

4 Discussion

Herein, we extracted radiomics and DTL features from HIM on post-thrombectomy NCCT, integrated them with clinical data, and developed a nomogram to predict discharge prognosis. Importantly, our nomogram achieved an AUC of 0.894 in a prospective test cohort, surpassing single-modality models and demonstrating its superior predictive performance for discharge outcomes.

We defined an unfavorable discharge outcome as an mRS score of 4–6. Several studies have previously employed the mRS at discharge as a prognostic tool for assessing early outcomes following ischemic or hemorrhagic stroke treatment, with some establishing an mRS score greater than 3 as the threshold for an unfavorable prognosis (Hallevi et al., 2009; Shi et al., 2018; Zhao et al., 2023). Among the clinical variables in our nomogram, the baseline NIHSS score was the most influential predictor of discharge prognosis following MT, consistent with prior findings (Zhang et al., 2025). Baseline NIHSS scores, which measure stroke severity, reliably predict outcomes in large vessel occlusion, with higher scores indicating greater neurological deficits and worse prognosis (Li et al., 2018). Although the nomogram demonstrated stronger predictive performance than the clinical model in the test cohort, the difference in AUC between the two was not statistically significant (p = 0.079). The relatively strong performance of the clinical model alone may be attributed to its inclusion of the most influential predictor—baseline NIHSS—which captures a substantial amount of prognostic information and thereby enhances its predictive accuracy.

Our model not only incorporates clinical indicators but also integrates deep learning and radiomics models to form a robust predictive framework. Deep learning models have demonstrated strong performance in prognostic prediction. Nishi and Dipros previously utilized the convolutional neural network (CNN) and 3D DenseNet121 deep learning models to predict functional outcomes 3 months after MT, achieving an AUC of 0.81 (Nishi et al., 2020; Diprose et al., 2025). In contrast, our study leveraged HIM features extracted from post-thrombectomy CT scans. These CT images, acquired 1 hour after the procedure, effectively capture reperfusion injury while minimizing the confounding influence of preoperative variability (Bai and Lyden, 2015). Moreover, our model demonstrates superior predictive performance compared to previous approaches. Here, we employed the VGG19_BN model, leveraging its stacked 3 × 3 convolutional kernels to deepen the network and enhance feature extraction capabilities. The batch normalization in VGG19_BN effectively reduces internal covariate shift, accelerates training, and stabilizes gradient flow, thereby improving prognostic accuracy (Han et al., 2024). Radiomics, widely applied in predicting stroke discharge outcomes, provides quantitative insights through feature extraction and has shown considerable promise in forecasting prognosis after thrombectomy. Evidence suggests that models integrating deep learning and radiomics significantly outperform standalone deep learning approaches due to their superior image feature extraction capabilities (Liu W. et al., 2023). Thus, our nomogram, which unifies clinical, deep learning, and radiomics features, establishes a robust predictive framework that enhances accuracy and supports personalized treatment strategies.

We acknowledge that the discharge mRS, while valuable for early prognostication, may not fully capture long-term functional outcomes. To explore the relationship between discharge status and 90-day prognosis, we conducted a sensitivity analysis using 90-day mRS scores as the endpoint in the test cohort. Despite the shift to a longer-term outcome measure, the model maintained robust discriminative ability (AUC = 0.854), confirming its stability and generalizability. This finding suggests a potential link between an unfavorable discharge outcome (mRS > 3) and an unfavorable 90-day prognosis (mRS > 2), indicating that clinicians could leverage this early marker to identify high-risk patients and implement targeted interventions to improve long-term outcomes.

To demonstrate the clinical utility of the nomogram, we identified a probability threshold of 0.760 that effectively stratifies patients by their risk of poor functional outcome at discharge (Figure 8). Patients with predicted probabilities above 0.760 warrant intensified neurological monitoring and early, personalized rehabilitation to mitigate secondary decline and improve recovery, while those below this threshold may receive standard post-thrombectomy care, optimizing resource allocation.

Figure 8
Axial CT brain images showing two sections labeled A and B. Section A demonstrates a subtle area of contrast extravasation. Section B shows a larger region of intense hyperdensity, indicative of marked contrast extravasation; the possibility of hemorrhage cannot be definitively excluded.

Figure 8. Representative examples illustrating the clinical application of the nomogram. (A) A 31-year-old male patient with a baseline NIHSS score of 11. The non-contrast CT image acquired immediately after thrombectomy is shown. The nomogram-predicted risk score was 0.444 (below the clinical threshold of 0.760), indicating a favorable discharge outcome, which was consistent with the actual prognosis. (B) A 93-year-old male with a baseline NIHSS score of 21. The immediate post-thrombectomy non-contrast CT is presented. The nomogram predicted a risk score of 0.964, exceeding the 0.760 threshold, suggesting a high risk of poor discharge prognosis, which aligned with the clinical outcome.

We evaluated the predictive performance of our nomogram for discharge outcomes against that of senior and junior physicians. Using their extensive experience and deep familiarity with critical indicators, senior physicians surpassed junior physicians in performance but faced challenges in handling high-dimensional data or discerning subtle patterns, areas in which the model excelled. This highlights the potential of integrating machine learning models with clinical expertise to enhance prognostic accuracy and decision-making (Rajpurkar et al., 2017). This multi-omics model effectively captures complex biological and imaging patterns by incorporating DTL-derived imaging features, traditional radiomics, and clinical variables, offering objectivity and consistency that surpasses human capability (Choi et al., 2023). The nomogram could serve as a reliable decision-support tool in the future, enhancing clinical precision by integrating real-time patient data with physician expertise, particularly benefiting junior physicians and time-sensitive scenarios.

Nonetheless, the present study has several limitations. Firstly, it was conducted at a single center using two scanners with standardized protocols, which minimizes intra-institutional variability but may limit generalizability to diverse clinical settings. Variations in scanner vendors, acquisition protocols, and patient populations across institutions could impact feature robustness and model performance. To address this, we plan prospective multicenter studies with diverse imaging sources and populations to evaluate model transferability and clinical utility in future. Secondly, the small sample size may reduce statistical power and external validity, particularly in the prospective test cohort. Thirdly, while discharge outcomes are closely related to 90-day prognosis, this association is not absolute and requires further validation. Finally, while manual lesion segmentation by experienced neuroradiologists ensures rigor, it introduces subjectivity and variability, particularly for poorly defined lesions, potentially biasing our results. To address this, future research should explore automated segmentation techniques to enhance reproducibility and reduce variability. Therefore, future multicenter studies with larger, more diverse cohorts and automated feature extraction are warranted to address these limitations and further validate our model’s clinical utility.

5 Conclusion

In conclusion, our study introduces a robust predictive model that effectively integrates radiomics, deep learning, and clinical data to predict patient outcomes following thrombectomy. The high predictive accuracy of our nomogram makes it a promising clinical decision-making tool, with the potential to enhance patient prognosis and improve personalized treatment strategies.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving humans were approved by the Ethics Committee of the Fourth Affiliated Hospital, Zhejiang University College of Medicine. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation in this study was provided by the participants’ legal guardians/next of kin. Written informed consent was obtained from the individual(s) for the publication of any potentially identifiable images or data included in this article.

Author contributions

FL: Writing – original draft, Methodology, Investigation, Formal analysis, Conceptualization, Writing – review & editing. JH: Data curation, Visualization, Validation, Software, Writing – review & editing. YC: Writing – review & editing, Validation, Investigation, Methodology. HL: Methodology, Validation, Investigation, Writing – review & editing. YW: Writing – review & editing, Software, Data curation. LS: Writing – review & editing, Software, Data curation. SH: Supervision, Resources, Funding acquisition, Writing – review & editing, Project administration. JF: Resources, Funding acquisition, Project administration, Writing – review & editing, Supervision.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. This work was supported by Medical Scientific Research Foundation of Zhejiang Province, China (No: 2025KY090) and the Fundamental Research Funds for the Central Universities (No: 226202400226).

Acknowledgments

Some of our experiments were carried out on the Onekey AI platform. Thank Onekey AI and its developers for help in this scientific research work.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The authors declare that no Gen AI was used in the creation of this manuscript.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Footnotes

References

Bai, J., and Lyden, P. D. (2015). Revisiting cerebral Postischemic reperfusion injury: new insights in understanding reperfusion failure, Hemorrhage, and Edema. Int. J. Stroke 10, 143–152. doi: 10.1111/ijs.12434

PubMed Abstract | Crossref Full Text | Google Scholar

Chen, W.-H., Yi, T.-Y., Wu, Y.-M., Zhang, M.-F., Lin, D.-L., and Lin, X.-H. (2019). Parenchymal hyperdensity on C-arm CT images after endovascular therapy for acute ischaemic stroke predicts a poor prognosis. Clin. Radiol. 74, 399–404. doi: 10.1016/j.crad.2019.01.009

PubMed Abstract | Crossref Full Text | Google Scholar

Chen, S.-D., You, J., Yang, X.-M., Gu, H.-Q., Huang, X.-Y., Liu, H., et al. (2022). Machine learning is an effective method to predict the 90-day prognosis of patients with transient ischemic attack and minor stroke. BMC Med. Res. Methodol. 22:195. doi: 10.1186/s12874-022-01672-z

PubMed Abstract | Crossref Full Text | Google Scholar

Choi, A., Choi, S. Y., Chung, K., Chung, H. S., Song, T., Choi, B., et al. (2023). Development of a machine learning-based clinical decision support system to predict clinical deterioration in patients visiting the emergency department. Sci. Rep. 13:8561. doi: 10.1038/s41598-023-35617-3

PubMed Abstract | Crossref Full Text | Google Scholar

Cooray, C., Mazya, M., Bottai, M., Dorado, L., Skoda, O., Toni, D., et al. (2016). External validation of the ASTRAL and DRAGON scores for prediction of functional outcome in stroke. Stroke 47, 1493–1499. doi: 10.1161/STROKEAHA.116.012802

PubMed Abstract | Crossref Full Text | Google Scholar

Diprose, J. P., Diprose, W. K., Chien, T.-Y., Wang, M. T. M., McFetridge, A., Tarr, G. P., et al. (2025). Deep learning on pre-procedural computed tomography and clinical data predicts outcome following stroke thrombectomy. J. Neurointervent. Surg. 17, 266–271. doi: 10.1136/jnis-2023-021154

PubMed Abstract | Crossref Full Text | Google Scholar

Dong, M., Chen, C., Chen, W., and An, K. (2024). A CT texture-based nomogram for predicting futile reperfusion in patients with intraparenchymal hyperdensity after endovascular thrombectomy for acute anterior circulation large vessel occlusion. Front. Neurol. 15:1327585. doi: 10.3389/fneur.2024.1327585

PubMed Abstract | Crossref Full Text | Google Scholar

ElHabr, A. K., Katz, J. M., Wang, J., Bastani, M., Martinez, G., Gribko, M., et al. (2021). Predicting 90-day modified Rankin scale score with discharge information in acute ischaemic stroke patients following treatment. BMJ Neurol. Open 3:e000177. doi: 10.1136/bmjno-2021-000177

PubMed Abstract | Crossref Full Text | Google Scholar

Hallevi, H., Barreto, A. D., Liebeskind, D. S., Morales, M. M., Martin-Schild, S. B., Abraham, A. T., et al. (2009). Identifying patients at high risk for poor outcome after intra-arterial therapy for acute ischemic stroke. Stroke 40, 1780–1785. doi: 10.1161/STROKEAHA.108.535146

PubMed Abstract | Crossref Full Text | Google Scholar

Han, B., Chang, Y., Tan, R., and Han, C. (2024). Evaluating deep learning techniques for identifying tongue features in subthreshold depression: a prospective observational study. Front. Psych. 15:1177. doi: 10.3389/fpsyt.2024.1361177

PubMed Abstract | Crossref Full Text | Google Scholar

Heo, J., Yoon, J. G., Park, H., Kim, Y. D., Nam, H. S., and Heo, J. H. (2019). Machine learning–based model for prediction of outcomes in acute stroke. Stroke 50, 1263–1265. doi: 10.1161/STROKEAHA.118.024293

PubMed Abstract | Crossref Full Text | Google Scholar

Hu, S., Hong, J., Liu, F., Wang, Z., Li, N., Wang, S., et al. (2024). An integrated nomogram combining clinical and radiomic features of hyperattenuated imaging markers to predict malignant cerebral edema following endovascular thrombectomy. Quant. Imaging Med. Surg. 14, 4936–4949. doi: 10.21037/qims-24-99

PubMed Abstract | Crossref Full Text | Google Scholar

Hu, S., Liu, J., Hong, J., Chen, Y., Wang, Z., Hu, J., et al. (2025). Machine learning, clinical-radiomics approach with HIM for hemorrhagic transformation prediction after thrombectomy and treatment. Front. Neurol. 16:1471274. doi: 10.3389/fneur.2025.1471274

PubMed Abstract | Crossref Full Text | Google Scholar

Jiang, Q., Hou, J., Ge, J., Huang, Z., Wang, H., Guo, Z., et al. (2021). Clinical significance of Hyperdense area after endovascular therapy in patients with acute ischemic stroke: a systematic review and meta-analysis. Cerebrovasc. Dis. 50, 500–509. doi: 10.1159/000515410

PubMed Abstract | Crossref Full Text | Google Scholar

Karamchandani, R. R., Strong, D., Rhoten, J. B., Prasad, T., Selig, J., Defilipp, G., et al. (2021). Age and discharge modified Rankin score are associated with 90-day functional outcome after basilar artery occlusion treated with endovascular therapy. Interv. Neuroradiol. 27, 531–538. doi: 10.1177/1591019920987040

PubMed Abstract | Crossref Full Text | Google Scholar

Li, X., He, Y., Wang, D., and Rezaei, M. J. (2024). Stroke rehabilitation: from diagnosis to therapy. Front. Neurol. 15:2729. doi: 10.3389/fneur.2024.1402729

PubMed Abstract | Crossref Full Text | Google Scholar

Li, J., Zhang, P., Wu, S., Yi, X., Wang, C., and Liu, M. (2018). Factors associated with favourable outcome in large hemispheric infarctions. BMC Neurol. 18:152. doi: 10.1186/s12883-018-1148-7

PubMed Abstract | Crossref Full Text | Google Scholar

Liu, W., Wang, W., Zhang, H., Guo, M., Xu, Y., and Liu, X. (2023). Development and validation of multi-omics Thymoma risk classification model based on transfer learning. J. Digit. Imaging 36, 2015–2024. doi: 10.1007/s10278-023-00855-4

PubMed Abstract | Crossref Full Text | Google Scholar

Liu, Y., Yu, Y., Ouyang, J., Jiang, B., Yang, G., Ostmeier, S., et al. (2023). Functional outcome prediction in acute ischemic stroke using a fused imaging and clinical deep learning model. Stroke 54, 2316–2327. doi: 10.1161/STROKEAHA.123.044072

PubMed Abstract | Crossref Full Text | Google Scholar

Lummel, N., Schulte-Altedorneburg, G., Bernau, C., Pfefferkorn, T., Patzig, M., Janssen, H., et al. (2014). Hyperattenuated intracerebral lesions after mechanical recanalization in acute stroke. Am. J. Neuroradiol. 35, 345–351. doi: 10.3174/ajnr.A3656

PubMed Abstract | Crossref Full Text | Google Scholar

Morsi, S., Ghozy, S., Elfil, M., Tolba, H., Rabenstein, A., Kadirvel, R., et al. (2024). Endovascular thrombectomy for DAWN- and DEFUSE-3 ineligible acute ischemic stroke patients: a systematic review and meta-analysis. J. Neurol. 271, 2230–2237. doi: 10.1007/s00415-024-12198-3

PubMed Abstract | Crossref Full Text | Google Scholar

Nishi, H., Oishi, N., Ishii, A., Ono, I., Ogura, T., Sunohara, T., et al. (2020). Deep learning–derived high-level neuroimaging features predict clinical outcomes for large vessel occlusion. Stroke 51, 1484–1492. doi: 10.1161/STROKEAHA.119.028101

PubMed Abstract | Crossref Full Text | Google Scholar

Rajpurkar, P., Irvin, J., Zhu, K., Yang, B., Mehta, H., Duan, T., et al. (2017). CheXNet: radiologist-level pneumonia detection on chest x-rays with deep learning. Available online at: https://arxiv.org/abs/1711.05225 (Accessed April 1, 2025).

Google Scholar

Shi, Z., Duckwiler, G. R., Jahan, R., Tateshima, S., Szeder, V., Saver, J. L., et al. (2018). Early blood-brain barrier disruption after mechanical thrombectomy in acute ischemic stroke. J. Neuroimaging 28, 283–288. doi: 10.1111/jon.12504

PubMed Abstract | Crossref Full Text | Google Scholar

Song, Y., Hong, J., Liu, F., Liu, J., Chen, Y., Li, Z., et al. (2025). Deep learning-assisted diagnosis of malignant cerebral Edema following endovascular Thrombectomy. Acad. Radiol. 32, 3237–3247. doi: 10.1016/j.acra.2025.02.021

PubMed Abstract | Crossref Full Text | Google Scholar

Taleb, S., Lee, J. J., Duncan, P., Cramer, S. C., Bahr-Hosseini, M., Su, M., et al. (2023). Essential information for neurorecovery clinical trial design: trajectory of global disability in first 90 days post-stroke in patients discharged to acute rehabilitation facilities. BMC Neurol. 23:239. doi: 10.1186/s12883-023-03251-1

PubMed Abstract | Crossref Full Text | Google Scholar

Zhang, Y., Zheng, T., Wang, H., Zhu, J., Duan, S., and Song, B. (2025). Predicting functional outcomes of endovascular Thrombectomy in acute ischemic stroke using a clinical-Radiomics nomogram. World Neurosurg. 193, 911–919. doi: 10.1016/j.wneu.2024.10.073

PubMed Abstract | Crossref Full Text | Google Scholar

Zhao, X., Zhou, B., Luo, Y., Chen, L., Zhu, L., Chang, S., et al. (2023). CT-based deep learning model for predicting hospital discharge outcome in spontaneous intracerebral hemorrhage. Eur. Radiol. 34, 4417–4426. doi: 10.1007/s00330-023-10505-6

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: prognosis, nomogram, thrombectomy, acute ischemic stroke, multi-detector CT

Citation: Liu F, Hong J, Chen Y, Liu H, Wang Y, Su L, Hu S and Fu J (2025) Development and validation of a CT-based multi-omics nomogram for predicting hospital discharge outcomes following mechanical thrombectomy. Front. Neurosci. 19:1643014. doi: 10.3389/fnins.2025.1643014

Received: 07 June 2025; Accepted: 28 July 2025;
Published: 07 August 2025.

Edited by:

Dong Zeng, Southern Medical University, China

Reviewed by:

Xinyu Zhang, Huizhou Central People’s Hospital, China
Shuai Fu, The Sixth Affiliated Hospital of Sun Yat-sen University, China

Copyright © 2025 Liu, Hong, Chen, Liu, Wang, Su, Hu and Fu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Sheng Hu, ODAxNTEzNUB6anUuZWR1LmNu; Jingjing Fu, ZnVqaW5namluZzE5ODVAemp1LmVkdS5jbg==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.