- 1Department of Pain, Suining Central Hospital, Suining, Sichuan, China
- 2Department of Gastroenterology, West China Fourth Hospital, Sichuan University, Chengdu, Sichuan, China
- 3Department of Rehabilitation, Suining First People's Hospital, Suining, Sichuan, China
- 4Department of Geriatrics, Suining Central Hospital, Suining, Sichuan, China
Objective: To systematically review existing predictive models for futile recanalization after mechanical thrombectomy in patients with acute ischemic stroke, in order to provide a basis for treatment decision-making.
Methods: Relevant studies on predictive models of futile recanalization after mechanical thrombectomy for acute ischemic stroke were searched in PubMed, Web of Science, Embase, The Cochrane Library, CNKI, Wanfang, and VIP databases from inception to May 5, 2024. Reference lists were also manually searched as supplements. Two researchers independently performed the literature search, screening, and data extraction, and conducted risk of bias and quality assessments. Because most included studies did not provide 95% confidence intervals or standard errors of AUC values, a formal quantitative meta-analysis of model performance was not feasible. Instead, we conducted a stratified descriptive synthesis of AUC values according to modeling approach (traditional regression vs. machine learning/deep learning).
Results: Thirteen studies were included, encompassing 23 predictive models for futile recanalization. Variables used in the models mainly involved baseline clinical and imaging features. The most frequently included predictors were age, NIHSS score, baseline mRS score, and baseline Alberta Stroke Program Early CT Score (ASPECTS). The AUC of the models ranged from 0.650 to 0.981, with 11 models reporting AUC values ≥0.8, indicating high predictive performance.
Conclusion: Predictive models for futile recanalization after mechanical thrombectomy in acute ischemic stroke are still under development. While many models exhibit good discrimination, they commonly face a high risk of bias. Future research should emphasize external validation and optimization of existing models to improve their performance, reduce bias, and promote clinical implementation.
Systematic review registration: The systematic review was registered in PROSPERO under the ID CRD42022382797. https://www.crd.york.ac.uk/prospero/display_record.php?ID=CRD42022382797.
1 Introduction
Stroke is the leading cause of disability and the second leading cause of death globally, with ischemic stroke accounting for approximately 82% of all cases. Data indicate that deaths due to ischemic stroke have increased by 60.68% over the past 30 years. With the aging population trend intensifying, the burden of ischemic stroke is expected to rise further (1, 2). National and international guidelines recommend mechanical thrombectomy (MT) as an effective treatment for acute large vessel occlusion (LVO) ischemic stroke, suitable within 24 h of onset for patients with anterior circulation occlusion and salvageable brain tissue (3). Despite the widespread adoption and continuous optimization of MT, futile recanalization remains a significant clinical issue, with postoperative rates ranging from 49 to 67% (4). Futile recanalization refers to achieving mTICI grade 2b or 3 recanalization after endovascular therapy without attaining functional independence at 90 days (5).
Numerous predictive models have been developed and validated to assess the risk of futile recanalization post-MT in patients with acute ischemic stroke, such as the HIAT and THRIVE scores (6, 7). These models, like those predicting symptomatic intracranial hemorrhage (sICH) after intravenous thrombolysis (e.g., MSS, HAT, SITS-sICH, GRASPS, SPAN-100, and SEDAN), are mainly based on traditional logistic regression (LR) methods. However, LR-based models are prone to issues such as multicollinearity and overfitting (5).
In recent years, with a deeper understanding of stroke pathophysiology, more factors such as patient history, laboratory parameters, and imaging characteristics have been found to be associated with futile recanalization after MT. Machine learning (ML) algorithms have shown strong utility in stroke diagnosis, treatment, and prognosis prediction, leading to the development of many new models. Nonetheless, current models vary significantly in quality, performance, and clinical applicability, and systematic reviews are lacking. Therefore, this study aims to synthesize and evaluate existing predictive models of futile recanalization through systematic review, quality assessment, and meta-analysis to provide scientific evidence for model optimization and clinical application.
2 Methods
This systematic review was registered in PROSPERO (ID: CRD42022382797) and was conducted in accordance with the PRISMA guidelines.
2.1 Search strategy
A comprehensive search was conducted in PubMed, Web of Science, Embase, The Cochrane Library, CNKI, Wanfang, and VIP databases for studies on predictive models of futile recanalization after mechanical thrombectomy in patients with acute ischemic stroke. The search period spanned from database inception to May 5, 2024. Additional references were identified through manual screening of bibliographies. Chinese search terms included: “ischemic stroke/stroke/cerebrovascular accident/cerebral infarction/predictive model/endovascular recanalization/mechanical thrombectomy”; English search terms included: “ischemic stroke”/“brain ischemia”/“large vessel occlusion”/“endovascular thrombectomy”/“mechanical thrombectomy”/“risk prediction model.”
2.2 Inclusion and exclusion criteria
2.2.1 Inclusion criteria
(1) Study types: case-control and cohort studies; (2) Study population: patients aged ≥8 years with a diagnosis of stroke based on commonly accepted criteria and confirmed by CT or MRI; (3) Content: development and/or validation of predictive models for futile recanalization.
2.2.2 Exclusion criteria
(1) Models including non-LVO stroke patients (e.g., hemorrhagic or lacunar strokes); (2) Duplicate publications from the same cohort; (3) Studies with incomplete model construction information or lacking performance assessment; (4) Conference abstracts, reviews, letters, commentaries, editorials, and corrigenda were excluded.
2.3 Literature screening
After de-duplication in EndNote X9, two reviewers independently screened the studies based on inclusion and exclusion criteria. Title and abstract were screened initially, followed by full-text evaluation. Disagreements were resolved through discussion or consultation with a third reviewer.
2.4 Data extraction
Data were extracted by two researchers according to the CHARMS checklist (Critical Appraisal and Data Extraction for Systematic Reviews of Prediction Modeling Studies) (8). A standardized form was used to ensure consistency, capturing details such as first author, publication year, title, country, study design, sample size, data source, diagnostic methods, number of models, outcome indicators, candidate predictors, modeling methods, variable selection techniques, model performance, validation methods, model presentation, and number and names of predictors.
2.5 Risk of bias and quality assessment
Risk of bias was assessed using the Prediction Model Risk of Bias Assessment Tool (PROBAST) (9), which includes 20 items across four domains: participants, predictors, outcomes, and analysis. Two researchers independently evaluated the studies, and discrepancies were resolved by a third reviewer. The Transparent Reporting of a Multivariable Prediction Model for Individual Prognosis or Diagnosis (TRIPOD) checklist was used to assess reporting quality.
2.6 Statistical analysis
Model performance was assessed using the area under the receiver operating characteristic curve (AUC), with corresponding 95% confidence intervals (CI) extracted. Descriptive statistics were used to summarize study characteristics, model development and validation, and performance metrics.
3 Results
3.1 Literature search results
A total of 4,201 studies were initially identified. After removing duplicates, 2,365 articles remained. Following title and abstract screening and full-text review according to inclusion and exclusion criteria, 13 studies were included in the final analysis. These studies reported 23 predictive models for futile recanalization (10–22). The study selection process is illustrated in Figure 1.
Figure 1. PRISMA flow diagram of the literature screening and selection process for included studies.
3.2 Basic characteristics of included studies
The 13 studies were published between 2014 and 2023, including 2 in Chinese (20, 22) and 11 in English (10–19, 21). Seven studies were multicenter (11–15, 17, 20), and six were single-center studies (10, 16, 18, 19, 21, 22). Four studies developed four models each (11, 14, 17, 19), while the remaining nine developed a single model. Detailed characteristics of the included studies are presented in Table 1.
3.3 Model development
The sample sizes in the included studies ranged from 45 to 1,383 participants. The primary modeling approaches included logistic regression and machine learning algorithms (see Table 2). Six studies employed univariate analysis for variable selection (12, 13, 15–18, 21); three studies applied the least absolute shrinkage and selection operator (LASSO) (18, 19, 22); and five studies used machine learning-based selection methods (11, 14, 18, 19, 21). One study did not specify the method for variable selection (10). When stratified by modeling approach, regression-based models (n = 4) achieved consistent performance, with AUCs ranging from 0.78 to 0.87 (mean 0.83, median 0.84). In contrast, machine learning/deep learning models (n = 18) exhibited a wider distribution of performance (0.65–0.98), with a mean AUC of 0.81 and a median of 0.78 (Table 3).
The AUC values of the 23 models ranged from 0.650 to 0.981, with 11 models achieving AUC ≥ 0.8, indicating good predictive performance. One model was calibrated using the Brier score (13) and another using the Hosmer–Lemeshow test (15); the rest did not report calibration measures. Internal validation was conducted in 11 studies (10, 11, 13–21), while five models underwent both internal and external validation (17, 20). Among the internally validated models, two used bootstrapping (10, 15), while the others employed cross-validation techniques. Three studies presented their models using nomograms (15, 16, 22), enabling intuitive and individualized risk prediction for clinical use. A comparative summary of external validation, calibration reporting, and nomogram availability across the included models is presented in Table 4. Notably, only five models underwent external validation, calibration was reported in only two studies, and three models were presented as nomograms.
Table 4. Comparative summary of external validation, calibration reporting, and nomogram availability among included predictive models.
3.4 Predictive factors included in the models
Across the 23 predictive models, a total of 39 distinct predictors were reported. The most frequently used variables were age, NIHSS score, baseline mRS score, and baseline ASPECTS. These factors were repeatedly identified as key indicators of poor functional outcomes despite successful recanalization. The presentation formats of the models included mathematical formulas and interactive nomograms.
3.5 Risk of bias assessment
3.5.1 Participants domain
All 13 studies exhibited a low risk of bias in the participants domain overall. However, two studies were rated as having an unclear risk concerning the appropriateness of inclusion and exclusion criteria (11, 18), as their datasets were derived from multiple cohorts with potentially inconsistent enrollment standards.
3.5.2 Predictors domain
Most studies demonstrated a low risk of bias in the predictors domain. Nonetheless, in eight studies, the consistency of predictor definition and measurement across all participants was unclear (11–15, 17, 20, 22). This was primarily due to the retrospective design of the datasets, where data were not originally collected for the purpose of model development or validation, raising concerns about blinded assessment of predictors.
3.5.3 Outcomes domain
Bias in the outcomes domain was generally low. Some studies, however, lacked detailed descriptions of how outcome variables were defined and whether outcome adjudication was independent of predictor information. This raised potential concerns regarding assessment bias.
3.5.4 Analysis domain
According to best practices, development studies should include at least 20 events per predictor variable (EPV), and validation studies should enroll at least 100 participants. Five studies did not meet these criteria (12, 14–16, 21). Most studies also did not report how continuous variables were handled, and categorization may have led to information loss. Missing data were managed in six studies through deletion, single imputation, or multiple imputation; the remainder did not report any missing data handling strategies (10, 11, 13, 16, 18, 19). Six studies used univariate analysis for predictor selection (12, 13, 15–18, 21), potentially omitting relevant covariates and increasing bias risk. Detailed risk of bias assessments are shown in Figure 2.
3.6 Quality assessment of included studies
According to TRIPOD criteria, all included studies achieved a “good” rating (reporting >70% of required items). However, most studies lacked detailed reporting on sample size calculations, handling of missing data, procedures for model updating, full parameter estimates, application instructions, and updated model results.
4 Discussion
4.1 General characteristics of predictive models for futile recanalization
This systematic review comprehensively examined predictive models for futile recanalization following mechanical thrombectomy in patients with acute ischemic stroke. Overall, this research area is still in its developmental phase, with a wide temporal span among the included studies. Nevertheless, the models generally demonstrated good predictive performance. Among the 23 models constructed across 13 studies, AUC values ranged from 0.650 to 0.981. As shown in Figure 3, 20 models had AUC values ≥0.70, and 11 models achieved AUC values ≥0.80, reflecting strong discriminatory ability for identifying patients at risk of futile recanalization.
Internal validation was performed in 20 models, which is critical to reduce overfitting and improve generalizability (23). Several studies utilized nomograms for model visualization, enabling intuitive, individualized risk assessment. Nomograms have become increasingly popular in clinical prediction due to their user-friendly, graphical interface, enhancing clinical applicability. For stroke clinicians and nurses, such tools may support personalized decision-making and optimize perioperative care strategies for patients undergoing mechanical thrombectomy (24).
4.2 Limitations of existing models
While the number of predictive models for futile recanalization has increased in recent years, significant methodological limitations persist, particularly in the analysis domain. Common issues included insufficient event-per-variable ratios, inadequate handling of missing data, reliance on univariate analysis for variable selection, and limited reporting of model calibration and performance metrics.
Sample size estimation is crucial in prediction modeling, and underpowered studies can result in unreliable models. Several included studies failed to meet the recommended sample size threshold, which is typically 20 times the number of candidate predictors (25). Furthermore, only a few studies—such as those by Hu, Lin, and Van (13, 16, 18)—appropriately addressed missing data using imputation techniques. Most others either excluded incomplete cases or did not report missing data handling at all, potentially introducing selection bias and reducing model robustness.
Univariate analysis though statistically convenient, is often inadequate for reliable variable selection as it may omit clinically relevant variables that are not statistically significant in isolation (12, 13, 15–18, 26). It also increases the risk of multicollinearity and overfitting (27). Therefore, integrating multiple variable selection strategies, as demonstrated in Van’s study (18), which used univariate regression, LASSO, elastic net, and random forest-based importance ranking, is highly encouraged.
Moreover, many studies did not report AUC values or calibration metrics. Only two models reported calibration statistics. While the AUC reflects a model’s discriminative ability (28), calibration indicates the agreement between predicted probabilities and observed outcomes, which is equally critical for clinical applicability (29). According to the PROBAST assessment, all models were rated as high risk in the analysis domain. This reflects several methodological shortcomings: many studies did not achieve adequate sample sizes or events-per-variable ratios, increasing the likelihood of overfitting; variable selection was often based on univariate analysis alone, which may overlook relevant predictors; missing data were inadequately addressed; continuous variables were sometimes arbitrarily categorized; and calibration measures were rarely reported. Together, these issues explain the consistently high PROBAST risk ratings and highlight the need for stricter adherence to methodological guidelines in future studies. Future model development should adhere to PROBAST guidelines to minimize bias (9), and to TRIPOD reporting standards to ensure transparency and reproducibility (30).
4.3 Key predictors of futile recanalization
Meta-analysis identified age, NIHSS score, baseline mRS, and ASPECTS as the most consistent predictors across models. Baseline mRS has long been recognized as a powerful indicator of functional prognosis in ischemic stroke, and models incorporating this variable generally demonstrated better performance (31).
NIHSS, a standardized tool for quantifying neurological deficit, is widely used for initial stroke severity assessment. While higher NIHSS scores may suggest greater benefit from endovascular therapy, they are also associated with increased risk of futile recanalization. However, the optimal cutoff value remains undetermined (32).
Age was incorporated into 16 out of 22 models. Older patients tend to have greater comorbidity burden and poorer functional reserve, which may compromise recovery even after successful recanalization (33). Although current guidelines do not impose age limits on endovascular therapy, outcomes in patients ≥80 years remain debated, making age a critical factor in shared decision-making.
ASPECTS has also been validated as an independent predictor of futile recanalization (34). Originally proposed by Alberta researchers, ASPECTS is a semi-quantitative scoring system used to assess early ischemic changes on non-contrast CT, with higher scores indicating less infarct burden. Previous studies suggest that patients with ASPECTS ≥7 are more likely to benefit from thrombectomy.
Hilbert et al. (21) examined radiological features as predictors, but clinical utility may be limited by inter-institutional variability in imaging protocols and interpretation. Future multicenter studies with standardized imaging assessment are warranted to clarify the role of radiomics in prediction modeling.
4.4 Comparison of machine learning and logistic regression
The comparative advantages of machine learning (ML) versus traditional logistic regression (LR) in clinical prediction remain under discussion. LR relies on predefined assumptions and is well-suited for transparent (35), hypothesis-driven modeling. In contrast, ML emphasizes data-driven discovery, excels at handling high-dimensional and non-linear data, and can uncover hidden patterns to enhance prediction accuracy (36). Our stratified descriptive analysis showed that regression-based models achieved a relatively stable mean AUC of 0.83 (range 0.78–0.87), whereas machine learning/deep learning models achieved a similar mean AUC of 0.81 but with a wider range (0.65–0.98). This variability underscores the double-edged nature of ML approaches: while some models achieved excellent discrimination, others underperformed, likely due to small sample sizes and risk of overfitting.
In this review, models built with ML and deep learning techniques demonstrated generally superior or equivalent performance compared to LR models. For instance, Nishi and Van et al. (17, 18) showed that ML slightly outperformed LR in AUC, albeit with modest margins. These findings suggest ML may serve as a useful adjunct in clinical decision support, particularly when dealing with complex feature interactions.
However, most ML models were trained on relatively small datasets with limited external validation, raising concerns about overfitting and generalizability. Moreover, the interpretability of ML models remains a challenge in clinical settings. While LR provides explicit coefficients for each predictor, ML operates as a “black box,” often requiring advanced techniques such as SHAP or LIME to elucidate feature importance.
According to the “No Free Lunch” theorem proposed by Wolpert and Macready (37), no single algorithm universally outperforms others across all problems. Therefore, applying a range of modeling techniques and selecting the most appropriate approach based on data characteristics and clinical context is essential. Despite their potential, ML and DL models exhibit several important limitations. First, most were trained on relatively small datasets with limited external validation, raising concerns about overfitting and variable generalizability. Second, their interpretability remains limited: unlike regression-based models, which provide explicit coefficients for each predictor, ML models often function as “black boxes,” requiring advanced techniques such as SHAP or LIME to explain feature importance. Third, our stratified descriptive summary indicated that although the mean AUC of ML/DL models (0.81) was comparable to regression-based models (0.83), their performance was markedly more variable (0.65–0.98), suggesting unstable generalizability and the risk of overfitting in small or heterogeneous cohorts. A notable example is the Hilbert DL model, which achieved an AUC of only 0.65. This relatively poor performance may be explained by the small training sample size, as deep learning models typically require large amounts of data to extract robust feature representations. In addition, variability in imaging acquisition and preprocessing across centers may have further limited its generalizability. This case illustrates the vulnerability of ML/DL approaches to overfitting and performance instability when applied in data-limited or heterogeneous clinical settings.
4.5 Clinical utility of predictive models
The clinical applicability of prediction models for futile recanalization remains limited. Notably, none of the included studies reported decision curve analysis, which precluded a formal assessment of net clinical benefit. Nevertheless, a narrative evaluation of the existing models suggests several potential implications for practice. Models incorporating readily available clinical variables such as age, NIHSS score, and ASPECTS may assist clinicians in identifying patients at high risk of futile recanalization, thereby informing patient selection for mechanical thrombectomy and guiding perioperative management strategies. Such information may also support shared decision-making and risk communication with patients and families.
In addition, nomogram-based models (e.g., Li, Lin, Wei) offer intuitive (15, 16, 22), graphical representations of individual risk and are particularly suitable for bedside application and integration into electronic health records. These tools may enhance usability in clinical practice, enabling both physicians and nurses to make more personalized treatment and care plans. Future studies should incorporate decision curve analysis and cost-effectiveness evaluation to further establish the net clinical benefit of prediction models and to facilitate their translation into routine clinical workflows.
5 Conclusion
In this systematic review, we comprehensively evaluated 13 studies encompassing 23 predictive models aimed at identifying the risk of futile recanalization after mechanical thrombectomy in patients with acute ischemic stroke. While many models demonstrated satisfactory discriminatory performance, with 11 models reporting an AUC ≥ 0.8, several methodological limitations were observed. These included high risk of bias, insufficient sample size, inadequate handling of missing data, and a lack of external validation. Future research should focus on improving methodological rigor through adherence to PROBAST and TRIPOD guidelines, enhancing external validation across diverse populations and clinical settings, and exploring advanced modeling techniques such as interpretable machine learning. Optimized and validated models may ultimately support individualized decision-making and improve post-thrombectomy outcomes in stroke care.
Data availability statement
The datasets presented in this article are not readily available because this study is based on data extracted from previously published studies. No new patient data were collected or analyzed. Therefore, no dataset is available for sharing.
Author contributions
CC: Conceptualization, Writing – original draft. LL: Conceptualization, Writing – original draft. XL: Methodology, Writing – original draft. YT: Writing – review & editing.
Funding
The author(s) declare that no financial support was received for the research and/or publication of this article.
Acknowledgments
We express our sincere appreciation to Suining Central Hospital and West China Fourth Hospital of Sichuan University for their institutional support. Special thanks are extended to all the clinicians and data management staff involved in patient care and record-keeping, which formed the foundation of this review. We also thank our academic mentors and peer reviewers for their valuable insights and suggestions during the preparation of this manuscript.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The authors declare that no Gen AI was used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
1. Saini, V, Guada, L, and Yavagal, DR. Global epidemiology of stroke and access to acute ischemic stroke interventions. Neurology. (2021) 97:S6–S16. doi: 10.1212/WNL.0000000000012781
2. Qian, Y, Jing, G, and Huan, C. Systematic review of risk prediction models for post-stroke depression. Chin J Neuromed. (2022) 21:916–23.
3. Jadhav, AP, Desai, SM, and Jovin, TG. Indications for mechanical thrombectomy for acute ischemic stroke: current guidelines and beyond. Neurology. (2021) 97:S126–36. doi: 10.1212/WNL.0000000000012801
4. Deng, G, Chu, YH, Xiao, J, Shang, K, Zhou, LQ, Qin, C, et al. Risk factors, pathophysiologic mechanisms, and potential treatment strategies of futile recanalization after endovascular therapy in acute ischemic stroke. Aging Dis. (2023) 14:2096–112. doi: 10.14336/AD.2023.0321-1
5. Yunpeng, JI, Yongxin, LI, Shuang, HE, and Yunfeng, ZH. Multivariate analysis of futile recanalization after endovascular therapy in patients with acute anterior circulation occlusion. J Pract Clin Med. (2023) 27:77–80.
6. Hallevi, H, Barreto, AD, Liebeskind, DS, Morales, MM, Martin-Schild, SB, Abraham, AT, et al. Identifying patients at high risk for poor outcome after intra-arterial therapy for acute ischemic stroke. Stroke. (2009) 40:1780–5. doi: 10.1161/STROKEAHA.108.535146
7. Flint, AC, Cullen, SP, Faigeles, BS, and Rao, VA. Predicting long-term outcome after endovascular stroke treatment: the totaled health risks in vascular events score. AJNR Am J Neuroradiol. (2010) 31:1192–6. doi: 10.3174/ajnr.A2050
8. Himmelreich, JC, Veelers, L, Lucassen, WA, Schnabel, RB, Rienstra, M, van Weert, HC, et al. Prediction models for atrial fibrillation applicable in the community: a systematic review and meta-analysis. Europace. (2020) 22:684–94. doi: 10.1093/europace/euaa005
9. Wolff, RF, Moons, KG, Riley, RD, Whiting, PF, Westwood, M, Collins, GS, et al. PROBAST: a tool to assess the risk of Bias and applicability of prediction model studies. Ann Intern Med. (2019) 170:51–8. doi: 10.7326/M18-1376
10. Brugnara, G, Neuberger, U, Mahmutoglu, MA, Foltyn, M, Herweh, C, Nagel, S, et al. Multimodal predictive modeling of endovascular treatment outcome for acute ischemic stroke using machine-learning. Stroke. (2020) 51:3541–51. doi: 10.1161/STROKEAHA.120.030287
11. Feyen, L, Blockhaus, C, Katoh, M, Haage, P, Schaub, C, and Machine, RS. Machine learning based outcome prediction of large vessel occlusion of the anterior circulation prior to thrombectomy in patients with wake-up stroke. Interv Neuroradiol. (2022):320755377. doi: 10.1177/15910199221135695
12. Grech, R, Galvin, PL, Power, S, O'Hare, A, Looby, S, Brennan, P, et al. Outcome prediction in acute stroke patients considered for endovascular treatment: a novel tool. Interv Neuroradiol. (2014) 20:312–24. doi: 10.15274/INR-2014-10029
13. Hu, Y, Yang, T, Zhang, J, Wang, X, Cui, X, Chen, N, et al. Dynamic prediction of mechanical thrombectomy outcome for acute ischemic stroke patients using machine learning. Brain Sci. (2022) 12:938. doi: 10.3390/brainsci12070938
14. Jabal, MS, Joly, O, Kallmes, D, Harston, G, Rabinstein, A, Huynh, T, et al. Interpretable machine learning modeling for ischemic stroke outcome prediction. Front Neurol. (2022) 13:884693. doi: 10.3389/fneur.2022.884693
15. Li, X, Zou, Y, Hu, J, Li, XM, Huang, CP, Shan, YJ, et al. A NAC nomogram to predict the probability of three-month unfavorable outcome in Chinese acute ischemic stroke patients treated with mechanical thrombectomy. Int J Neurosci. (2021) 131:163–9. doi: 10.1080/00207454.2020.1733565
16. Lin, S, Lin, X, Zhang, J, Wan, M, Chen, C, Jie, Q, et al. A visualized nomogram to online predict futile recanalization after endovascular thrombectomy in basilar artery occlusion stroke. Front Neurol. (2022) 13:968037. doi: 10.3389/fneur.2022.968037
17. Nishi, H, Oishi, N, Ishii, A, Ono, I, Ogura, T, Sunohara, T, et al. Predicting clinical outcomes of large vessel occlusion before mechanical thrombectomy using machine learning. Stroke. (2019) 50:2379–88. doi: 10.1161/STROKEAHA.119.025411
18. Van Os, HJ, Ramos, LA, Hilbert, A, Van Leeuwen, M, Van Walderveen, MA, Kruyt, ND, et al. Predicting outcome of endovascular treatment for acute ischemic stroke: potential value of machine learning algorithms. Front Neurol. (2018) 9:784.
19. Zeng, W, Li, W, Huang, K, Lin, Z, Dai, H, He, Z, et al. Predicting futile recanalization, malignant cerebral edema, and cerebral herniation using intelligible ensemble machine learning following mechanical thrombectomy for acute ischemic stroke. Front Neurol. (2022) 13:982783. doi: 10.3389/fneur.2022.982783
20. Hanqi, C, Hao, Z, Xiaomin, G, et al. Prognosis prediction of mechanical thrombectomy in acute stroke based on machine learning combined with radiomic features. J Nanjing Med Univ. (2022) 42:1165–70.
21. Hilbert, A, Ramos, L A, Os, Hvan, Os, HJA, Olabarriaga, SD, Tolhuisen, ML, et al. Data-efficient deep learning of radiological image data for outcome prediction after endovascular treatment of patients with acute ischemic stroke Comput Biol Med (2019) 115:103516 doi: 10.1016/j.compbiomed.2019.103516
22. Shuiguo, W, Fan, Y, Yuehu, M, et al. Analysis of ineffective recanalization after mechanical thrombectomy in acute ischemic stroke based on DWI radiomics-clinical model. J Med Imag. (2023) 33:1945–9.
23. Guan, G, Lee, CM, Begg, S, Crombie, A, and Mnatzaganian, G. The use of early warning system scores in prehospital and emergency department settings to predict clinical deterioration: a systematic review and meta-analysis. PLoS One. (2022) 17:e265559
24. Balachandran, VP, Gonen, M, Smith, JJ, and DeMatteo, RP. Nomograms in oncology: more than meets the eye. Lancet Oncol. (2015) 16:e173–80. doi: 10.1016/S1470-2045(14)71116-7
25. Riley, RD, Ensor, J, Snell, KI, Harrell, FE, Martin, GP, Reitsma, JB, et al. Calculating the sample size required for developing a clinical prediction model. BMJ. (2020) 368:m441
26. van Horn, N, Kniep, H, Leischner, H, McDonough, R, Deb-Chatterji, M, Broocks, G, et al. Predictors of poor clinical outcome despite complete reperfusion in acute ischemic stroke patients. J Neurointerv Surg. (2021) 13:14–8. doi: 10.1136/neurintsurg-2020-015889
27. Grobman, WA, and Stamilio, DM. Methods of clinical prediction. Am J Obstet Gynecol. (2006) 194:888–94. doi: 10.1016/j.ajog.2005.09.002
28. Vickers, AJ, Cronin, AM, Elkin, EB, and Gonen, M. Extensions to decision curve analysis, a novel method for evaluating diagnostic tests, prediction models and molecular markers. BMC Med Inform Decis Mak. (2008) 8:53. doi: 10.1186/1472-6947-8-53
29. Van Calster, B, and Vickers, AJ. Calibration of risk prediction models: impact on decision-analytic performance. Med Decis Mak. (2015) 35:162–9. doi: 10.1177/0272989X14547233
30. Collins, GS, Reitsma, JB, Altman, DG, Collins, GS, Reitsma, JB, Altman, DG, et al. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): the TRIPOD statement. BMJ. (2015) 350:g7594. doi: 10.1136/bmj.g7594
31. Erler, KS, Wu, R, DiCarlo, JA, Petrilli, MF, Gochyyev, P, Hochberg, LR, et al. Association of modified Rankin scale with recovery phenotypes in patients with upper extremity weakness after stroke. Neurology. (2022) 98:e1877–85. doi: 10.1212/WNL.0000000000200154
32. Tao, Y, Houqin, C, Jiaqin, X, et al. Study on the relationship between intracranial artery calcification and prognosis of patients with acute large vessel occlusion ischemic stroke after mechanical thrombectomy. J Pract Clin Med. (2024) 28:79–83. doi: 10.7619/jcmp.20242770
33. Guohua, W, Xiaofei, Z, Qi, W, et al. Characteristics of carotid plaques and hemorheological parameters in elderly patients with ischemic stroke and their relation to stroke subtypes and severity. J Pract Clin Med. (2024) 28:58–62.
34. Almallouhi, E, Al Kasab, S, Hubbard, Z, Bass, EC, Porto, G, Alawieh, A, et al. Outcomes of mechanical thrombectomy for patients with stroke presenting with low Alberta stroke program early computed tomography score in the early and extended window. JAMA Netw Open. (2021) 4:e2137708
35. Boulesteix, AL, and Schmid, M. Machine learning versus statistical modeling. Biom J. (2014) 56:588–93. doi: 10.1002/bimj.201300226
36. Grendas, LN, Chiapella, L, Rodante, DE, and Daray, FM. Comparison of traditional model-based statistical methods with machine learning for the prediction of suicide behaviour. J Psychiatr Res. (2021) 145:85–91.
Keywords: acute ischemic stroke, large vessel occlusion, mechanical thrombectomy, futile recanalization, predictive model, systematic review
Citation: Chen C, Liu L, Liu X and Tan Y (2025) Systematic evaluation of predictive models for futile recanalization before thrombectomy in patients with acute ischemic stroke. Front. Neurol. 16:1625236. doi: 10.3389/fneur.2025.1625236
Edited by:
Xabier Urra, Hospital Clinic of Barcelona, SpainReviewed by:
Luwen Zhu, Heilongjiang University of Chinese Medicine, ChinaJunaid Ansari, Johns Hopkins Medicine, United States
Copyright © 2025 Chen, Liu, Liu and Tan. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Ya Tan, NzUxNzYwNDY2QHFxLmNvbQ==
†These authors have contributed equally to this work
Cheng Chen1†