ORIGINAL RESEARCH article

Front. Neuroimaging

Sec. Brain Imaging Methods

Volume 4 - 2025 | doi: 10.3389/fnimg.2025.1573816

This article is part of the Research TopicInnovative imaging in neurological disorders: bridging engineering and medicineView all 4 articles

Benchmarking Machine Learning Models in Lesion-Symptom Mapping for Predicting Language Outcomes in Stroke Survivors

Provisionally accepted
  • 1Artificial Intelligence Institute, College of Engineering and Computing, University of South Carolina, Columbia, South Carolina, United States
  • 2Department of Computer Science and Engineering, College of Engineering and Computing, University of South Carolina, Columbia, South Carolina, United States
  • 3Institute for Mind and Brain, University of South Carolina, USA, Columbia, SC, United States
  • 4Carolina Autism and Neurodevelopment Research Center, University of South Carolina, USA, Columbia, SC, United States
  • 5Department of Communication Sciences and Disorders, University of South Carolina, Columbia, SC, United States
  • 6Department of Psychology, Wright State University, USA, Dayton, OH, United States
  • 7Department of Psychology, University of South Carolina, USA, Columbia, SC, United States

The final, formatted version of the article will be published soon.

Several decades of research have investigated the neural connections between stroke-induced brain damage and language difficulties. Typically, lesion-symptom mapping (LSM) studies that address this connection have relied on mass univariate statistics, which do not account for multidimensional relationships between variables. Machine learning (ML) techniques, which can capture these intricate connections, offer a promising complement to LSM methods. To test this promise, we benchmarked ML models on structural and functional MRI to predict aphasia severity (N=238) and naming impairment (N=191) for a cohort of chronic-stage stroke survivors. We used nested cross-validation to examine performance along three dimensions: (1) parcellation schemes (JHU, AAL, BRO, and AICHA atlases), (2) neuroimaging modalities (resting-state functional connectivity, structural connectivity, mean diffusivity, fractional anisotropy, and lesion location) and (3) ML methods (Random Forest, Support Vector Regression, Decision Tree, K Nearest Neighbors, and Gradient Boosting). The best results were obtained by combining the JHU atlas, lesion location, and the Random Forest model. This combination yielded moderate to high correlations with the two different behavioral scores. Key regions identified included several perisylvian areas and pathways within the language network. This work complements existing LSM methods with new tools for improving the prediction of language outcomes in stroke survivors.

Keywords: Aphasia, lesion-symptom mapping, Neuroimaging, multivariate analysis, Stroke, machine learning

Received: 09 Feb 2025; Accepted: 06 May 2025.

Copyright: © 2025 Tilwani, O'Reilly, Riccardi, Shalin, Den Ouden, Fridriksson, Shinkareva, Sheth and Desai. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Deepa Tilwani, Artificial Intelligence Institute, College of Engineering and Computing, University of South Carolina, Columbia, 29208, South Carolina, United States

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.