ORIGINAL RESEARCH article
Front. Neuroinform.
Volume 19 - 2025 | doi: 10.3389/fninf.2025.1568116
This article is part of the Research TopicMachine Learning Algorithms for Brain Imaging: New Frontiers in Neurodiagnostics and TreatmentView all 10 articles
Evaluating Machine Learning Pipelines for Multimodal Neuroimaging in Small Cohorts: An ALS Case Study
Provisionally accepted- 1UMR7289 Institut de Neurosciences de la Timone (INT), Marseille, France
- 2UMR7339 Centre de Résonance Magnétique Biologique et Médicale (CRMBM), Marseille, Provence-Alpes-Côte d'Azur, France
- 3APHM, Hopital de la Timone, Referral Centre for Neuromuscular Diseases and ALS, Marseille, France
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
Advancements in machine learning hold great promise for the analysis of multimodal neuroimaging data. They can help identify biomarkers and improve diagnosis for various neurological disorders. However, the application of such techniques for rare and heterogeneous diseases remains challenging due to small-cohorts available for acquiring data. Efforts are therefore commonly directed towards improving the classification models, in an effort to optimize outcomes given the limited data. In this study, we systematically evaluated the impact of various machine learning pipeline configurations, including scaling methods, feature selection, dimensionality reduction, and hyperparameter optimization. The efficacy of such components in the pipeline was evaluated on classification performance using multimodal MRI data from a cohort of 16 ALS patients and 14 healthy controls. Our findings reveal that, while certain pipeline components, such as subject-wise feature normalization, help improve classification outcomes, the overall influence of pipeline refinements on performance is modest. Feature selection and dimensionality reduction steps were found to have limited utility, and the choice of hyperparameter optimization strategies produced only marginal gains. Our results suggest that, for small-cohort studies, the emphasis should shift from extensive tuning of these pipelines to addressing data-related limitations, such as progressively expanding cohort size, integrating additional modalities, and maximizing the information extracted from existing datasets. This study provides a methodological framework to guide future research and emphasizes the need for dataset enrichment to improve clinical utility.
Keywords: Amyotrophic Lateral Sclerosis, machine learning, multimodal MRI, Small Cohort, Classification, Pipeline optimization
Received: 28 Jan 2025; Accepted: 15 May 2025.
Copyright: © 2025 Appukuttan, Grapperon, El Mendili, Dary, Guye, RANJEVA, Zaaraoui and Gilson. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence: Shailesh Appukuttan, UMR7289 Institut de Neurosciences de la Timone (INT), Marseille, France
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.