ORIGINAL RESEARCH article
Front. Aging Neurosci.
Sec. Parkinson’s Disease and Aging-related Movement Disorders
Machine learning analysis of population-wide plasma proteins identifies hormonal biomarkers of Parkinson's Disease
Fayzan Chaudhry 1
Tae Wan Kim 2
Olivier Elemento 3
Doron Betel 3
1. Weill Cornell Medicine, Cornell University, New York, United States
2. Daegu Gyeongbuk Institute of Science & Technology, Daegu, Republic of Korea
3. Weill Cornell Medicine, New York, United States
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
Abstract
With the number of Parkinson's patients expected to rise due to an aging population, there is an increasing need to identify new diagnostic markers. These markers should be affordable and suitable for routine use to monitor the population, help stratify patients for treatment pathways and provide new avenues for therapy. Genetic predisposition and familial forms account for only around 10% of Parkinson’s disease (PD) cases, leaving a large fraction of the population with minimal effective markers for identifying high risk individuals. The establishment of population-wide omics and longitudinal health monitoring studies provides an opportunity to apply machine learning approaches to these unbiased cohorts to identify novel PD markers. Here we present the application of three machine learning models to identify protein plasma biomarkers of PD using plasma proteomics measurements from 43,408 UK Biobank subjects as the training and test set and an additional 103 samples from Parkinson’s Progression Markers Initiative (PPMI) as external validation. We identified a group of highly predictive protein plasma markers including known markers such as DDC and CALB2 as well as new markers involved in the JAK-STAT, PI3K-AKT pathways and hormonal signaling. We further demonstrate that these features are well correlated with UPDRS severity scores and stratify these to protective and risk associated features that potentially contribute to the pathogenesis of PD.
Summary
Keywords
biomarkers, deep learning, Neurodegenarative disease, Parkinson's disease, Proteomics
Received
22 October 2025
Accepted
10 February 2026
Copyright
© 2026 Chaudhry, Kim, Elemento and Betel. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Fayzan Chaudhry; Doron Betel
Disclaimer
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.