Machine learning analysis of population-wide plasma proteins identifies hormonal biomarkers of Parkinson's Disease

Chaudhry, Fayzan; Kim, Tae Wan; Elemento, Olivier; Betel, Doron

doi:10.3389/fnagi.2026.1730550

ORIGINAL RESEARCH article

Front. Aging Neurosci.

Sec. Parkinson’s Disease and Aging-related Movement Disorders

Machine learning analysis of population-wide plasma proteins identifies hormonal biomarkers of Parkinson's Disease

Fayzan Chaudhry ¹

Tae Wan Kim ²

Olivier Elemento ³

Doron Betel ³

1. Weill Cornell Medicine, Cornell University, New York, United States
2. Daegu Gyeongbuk Institute of Science & Technology, Daegu, Republic of Korea
3. Weill Cornell Medicine, New York, United States

Article metrics

View details

Views

The final, formatted version of the article will be published soon.

Abstract

With the number of Parkinson's patients expected to rise due to an aging population, there is an increasing need to identify new diagnostic markers. These markers should be affordable and suitable for routine use to monitor the population, help stratify patients for treatment pathways and provide new avenues for therapy. Genetic predisposition and familial forms account for only around 10% of Parkinson’s disease (PD) cases, leaving a large fraction of the population with minimal effective markers for identifying high risk individuals. The establishment of population-wide omics and longitudinal health monitoring studies provides an opportunity to apply machine learning approaches to these unbiased cohorts to identify novel PD markers. Here we present the application of three machine learning models to identify protein plasma biomarkers of PD using plasma proteomics measurements from 43,408 UK Biobank subjects as the training and test set and an additional 103 samples from Parkinson’s Progression Markers Initiative (PPMI) as external validation. We identified a group of highly predictive protein plasma markers including known markers such as DDC and CALB2 as well as new markers involved in the JAK-STAT, PI3K-AKT pathways and hormonal signaling. We further demonstrate that these features are well correlated with UPDRS severity scores and stratify these to protective and risk associated features that potentially contribute to the pathogenesis of PD.

Summary

Keywords

biomarkers, deep learning, Neurodegenarative disease, Parkinson's disease, Proteomics

Received

22 October 2025

Accepted

10 February 2026

© 2026 Chaudhry, Kim, Elemento and Betel. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Fayzan Chaudhry; Doron Betel

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

ORIGINAL RESEARCH article

Machine learning analysis of population-wide plasma proteins identifies hormonal biomarkers of Parkinson's Disease

Abstract

Summary

Outline

Share article

Article metrics