Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Neurol.

Sec. Sleep Disorders

Volume 16 - 2025 | doi: 10.3389/fneur.2025.1599135

Exploring common circulating diagnostic biomarkers for sleep disorders and stroke based on machine learning

Provisionally accepted
  • 1First Affiliated Hospital of Henan University of Traditional Chinese Medicine, Zhengzhou, China
  • 2Department of General Surgery, The First Affiliated Hospital, Dalian Medical University, Dalian, Liaoning, China, Dalian, China
  • 3Second Affiliated Hospital of Dalian Medical University, Dalian, Liaoning Province, China

The final, formatted version of the article will be published soon.

Background and objectives: Sleep disorders (SD) and stroke have long been health concerns. Sleep disorders are known to be a risk factor for stroke, and in recent years it has also been shown that the prevalence of sleep disorders is increased in stroke patients. We inferred that there is some inevitable connection between the two. This study aims to identify common molecular biomarkers and pathways connecting SD and stroke by integrating bioinformatics and machine learning approaches. Methods: We analysed transcriptome data from the GEO dataset to identify differentially expressed genes (DEGs). Key biological processes, as well as metabolic pathways, were highlighted by GO and KGEE enrichment analyses. Co-expression modules were then identified in the SD and stroke datasets by weighted gene co-expression network analysis (WGCNA), respectively, and machine learning algorithms (RandomForest, LASSO, and XGBoost) were performed to identify ARL2 as a key diagnostic biomarker with high predictive value (AUC = 0.91). This was finally complemented by animal experiments to verify that ARL2 was upregulated in the experimental group. Results: In GO and KEGG enrichment analyses, key biological processes such as 'response to external stimuli' and 'organic metabolic processes' as well as metabolic pathways such as 'propionate metabolism' and 'oxidative phosphorylation' were significantly enriched, suggesting their potential roles in the pathogenesis of the two disorders. With WGCNA and machine-learning algorithms analyses, we found that ARL2 is an important common marker for both diseases. Discussion: This study provides insights into the common molecular mechanisms of SD and stroke, highlighting the potential of ARL2 as a diagnostic marker and therapeutic target. Unlike previous studies, we used circulating markers rather than tissue markers, improving the clinical translation in terms of non-invasive, rapid identification of patients at risk for sleep disorders. We need to further investigate the functional role of these genes and their potential in developing targeted therapies for SD and stroke patients.

Keywords: Circulating diagnostic biomarkers, Sleep Disorders, machine learning, Stroke, ARL2

Received: 24 Mar 2025; Accepted: 16 Oct 2025.

Copyright: © 2025 Liu, Yu, Wang, Cai and Zhang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Xiangzhe Liu, liuxiangzhe@163.com

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.