ORIGINAL RESEARCH article
Front. Artif. Intell.
Sec. Medicine and Public Health
Volume 8 - 2025 | doi: 10.3389/frai.2025.1662859
Federated Learning for Cognitive Impairment Detection Using Speech Data
Provisionally accepted- 1Ace Alzheimer Center Barcelona - Universitat Internacional de Catalunya, Barcelona, Spain
- 2Department of Artificial Intelligence and Big Data, GMV, Madrid, Spain
- 3Networking Research Center on Neurodegenerative Diseases (CIBERNED), Instituto de Salud Carlos III, Madrid, Spain
- 4Accexible Impacto s.l., Urduliz, Spain
- 5Glenn Biggs Institute for Alzheimer’s & Neurodegenerative Diseases, University of Texas Health Science Center, San Antonio, Spain
- 6Department of Microbiology, Immunology and Molecular Genetics., Long School of Medicine. University of Texas Health Science Center, San Antonio, Spain
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
ABSTRACT Introduction: In Alzheimer's disease (AD) research, clinical, neuroimaging, genetic, and biomarker data are vital for advancing its understanding and treatment. However, privacy concerns and limited datasets complicate data sharing. Federated learning (FL) offers a solution by enabling collaborative research while preserving data privacy. Methods: This study analyzed data from patients assessed at the Memory Unit of the Ace Alzheimer Center Barcelona who completed a standardized digital speech protocol. Acoustic features extracted from these recordings were used to distinguish between cognitively unimpaired (CU) and cognitively impaired (CI) individuals. The aim was to evaluate how data heterogeneity impacted the FL model performance across three scenarios: (1) equal contributions and class ratios, (2) unequal contributions, and (3) imbalanced class ratios. In each scenario, the performance of local models trained using an MLP feed-forward neural network on institutional data was analyzed and compared to a global model created by aggregating these local models using Federated Averaging (FedAvg) and Iterative Data Aggregation (IDA). Results: The cohort included 2,239 participants: 221 CU individuals (mean age 66.8, 64.7% female) and 2,018 CI subjects, comprising 1,219 with mild cognitive impairment (mean age 74.3, 61.9% female) and 799 with mild AD dementia (mean age 80.8, 64.8% female). In scenarios 1 and 3, FL provided modest gains in accuracy and AUC. In scenario 2, FL markedly improved performance for the smaller dataset (balanced accuracy rising from 0.51 to 0.80) while preserving 0.86 accuracy in the larger dataset, highlighting scalability across heterogeneous conditions.In scenarios 1 and 3, FL application resulted in slight accuracy improvements. In scenario 2, FL enhanced model accuracy for the institution with a smaller dataset (improving from 0.61 to 0.86) while maintaining 0.86 accuracy for the institution with a larger dataset, demonstrating scalability for detecting cognitive impairment in different scenarios. Conclusion: These findings demonstrate the potential of FL to enable collaborative modeling of speech-based biomarkers for cognitive impairment detection, even under conditions of data imbalance and institutional disparity. This work highlights FL as a scalable and privacy-preserving approach for advancing digital health research in neurodegenerative diseases.
Keywords: deep learning, Alzheimer Disease, cognitive impairments, Speech Acoustics, Federated learning
Received: 09 Jul 2025; Accepted: 24 Sep 2025.
Copyright: © 2025 Blazquez-Folch, Limones Andrade, Calm, Auñón García, ALEGRET, Muñoz, Cano, Fernández, García-Gutiérrez, De Rojas, Garcia Gonzalez, Olivé, Puerta, Capdevila-Bayo, Muñoz-Morales, Bayón-Buján, Miguel Romero, Montrreal Navarro, Espinosa, Sanz-Cartagena, Rodriguez, Zaldua, Gabirondo, Cantero-Fortiz, Gurruchaga, Tárraga Mestre, Boada Rovira, Ruiz Laza, Marquié and Valero. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence: Sergi Valero, svalero@fundacioace.org
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.