ORIGINAL RESEARCH article
Front. Neuroimaging
Sec. Brain Imaging Methods
Volume 4 - 2025 | doi: 10.3389/fnimg.2025.1573816
This article is part of the Research TopicInnovative imaging in neurological disorders: bridging engineering and medicineView all 4 articles
Benchmarking Machine Learning Models in Lesion-Symptom Mapping for Predicting Language Outcomes in Stroke Survivors
Provisionally accepted- 1Artificial Intelligence Institute, College of Engineering and Computing, University of South Carolina, Columbia, South Carolina, United States
- 2Department of Computer Science and Engineering, College of Engineering and Computing, University of South Carolina, Columbia, South Carolina, United States
- 3Institute for Mind and Brain, University of South Carolina, USA, Columbia, SC, United States
- 4Carolina Autism and Neurodevelopment Research Center, University of South Carolina, USA, Columbia, SC, United States
- 5Department of Communication Sciences and Disorders, University of South Carolina, Columbia, SC, United States
- 6Department of Psychology, Wright State University, USA, Dayton, OH, United States
- 7Department of Psychology, University of South Carolina, USA, Columbia, SC, United States
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
Several decades of research have investigated the neural connections between stroke-induced brain damage and language difficulties. Typically, lesion-symptom mapping (LSM) studies that address this connection have relied on mass univariate statistics, which do not account for multidimensional relationships between variables. Machine learning (ML) techniques, which can capture these intricate connections, offer a promising complement to LSM methods. To test this promise, we benchmarked ML models on structural and functional MRI to predict aphasia severity (N=238) and naming impairment (N=191) for a cohort of chronic-stage stroke survivors. We used nested cross-validation to examine performance along three dimensions: (1) parcellation schemes (JHU, AAL, BRO, and AICHA atlases), (2) neuroimaging modalities (resting-state functional connectivity, structural connectivity, mean diffusivity, fractional anisotropy, and lesion location) and (3) ML methods (Random Forest, Support Vector Regression, Decision Tree, K Nearest Neighbors, and Gradient Boosting). The best results were obtained by combining the JHU atlas, lesion location, and the Random Forest model. This combination yielded moderate to high correlations with the two different behavioral scores. Key regions identified included several perisylvian areas and pathways within the language network. This work complements existing LSM methods with new tools for improving the prediction of language outcomes in stroke survivors.
Keywords: Aphasia, lesion-symptom mapping, Neuroimaging, multivariate analysis, Stroke, machine learning
Received: 09 Feb 2025; Accepted: 06 May 2025.
Copyright: © 2025 Tilwani, O'Reilly, Riccardi, Shalin, Den Ouden, Fridriksson, Shinkareva, Sheth and Desai. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence: Deepa Tilwani, Artificial Intelligence Institute, College of Engineering and Computing, University of South Carolina, Columbia, 29208, South Carolina, United States
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.