ORIGINAL RESEARCH article
Front. Aging Neurosci.
Sec. Parkinson’s Disease and Aging-related Movement Disorders
Volume 17 - 2025 | doi: 10.3389/fnagi.2025.1688653
Interpretable Machine Learning for Cognitive Impairment Prediction in Parkinson's Disease: A Multicenter Validation Study with SHAP Analysis
Provisionally accepted- 1Henan University of Science and Technology, Luoyang, China
- 2The First Affiliated Hospital of Henan University of Science and Technology, Luoyang, China
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
This study developed an interpretable machine learning framework to detect Parkinson's disease-related cognitive impairment (PD-CI) using routine clinical data. We analyzed 1,279 participants from the Parkinson's Progression Markers Initiative (PPMI) discovery cohort and 197 patients from an independent validation cohort, with PD-CI defined by MoCA ≤26 and UPDRS-I ≥1. Twenty-one clinical features—including hematological parameters, metabolic markers, and demographics—were preprocessed using synthetic minority over-sampling. Four machine learning models were trained via nested 5-fold cross-validation, with the Random Forest algorithm achieving superior performance (AUC=0.83), outperforming CatBoost (0.82), XGBoost (0.79), and neural networks (0.66). External validation preserved 71.57% accuracy. SHAP interpretability analysis identified age, neutrophil-to-lymphocyte ratio (NLR), and serum uric acid as critical predictors, revealing synergistic risk effects between elevated inflammation markers and reduced antioxidant levels. The framework demonstrated diagnostic accuracy comparable to advanced neuroimaging while highlighting neuroinflammation and oxidative stress as mechanistic drivers. Multicenter validation confirmed robustness across ethnic populations, supporting its utility in clinical monitoring.
Keywords: Parkinson's disease, cognitive impairment, Interpretable machine learning, XGBoost, random forest
Received: 19 Aug 2025; Accepted: 20 Oct 2025.
Copyright: © 2025 Wang and Yan. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence:
Ziyuan Wang, 1305926535@qq.com
Junqiang Yan, yanjq@haust.edu.cn
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.