ORIGINAL RESEARCH article
Front. Cardiovasc. Med.
Sec. Cardiovascular Genetics and Systems Medicine
Combining WGCNA and machine learning to identify mechanisms and biomarkers of hyperthyroidism and atrial fibrillation
Provisionally accepted- Department of Cardiovascular Surgery, The Affiliated Hospital of Shanxi Medical University, Shanxi Cardiovascular Hospital (Institute), Taiyuan, China
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
Background: Hyperthyroidism and atrial fibrillation (AF) are interrelated conditions with significant cardiovascular impact. While their clinical association is established, the molecular mechanisms remain unclear. Identifying shared biomarkers and pathways can advance understanding and guide therapy. Methods: The hyperthyroidism dataset GSE71956 and the AF dataset GSE115574 were obtained from the Gene Expression Omnibus (GEO) database. Differential gene analysis was performed using the "limma" package, and overlapping genes shared by both diseases were identified through weighted gene co-expression network analysis (WGCNA), followed by functional enrichment analysis. Machine learning algorithms were also applied to identify key biomarkers. To validate the predictive results, peripheral blood samples were collected for real-time quantitative polymerase chain reaction (RT-qPCR) analysis. Finally, immune infiltration analysis was conducted to evaluate immune cell changes in hyperthyroidism and AF. Results: Through differential gene screening and WGCNA, 23 overlapping genes associated with hyperthyroidism and AF were identified. Using least absolute shrinkage and selection operator (LASSO) and random forest (RF) machine learning algorithms, CXCL16 and TMEM127 were ultimately identified as key genes. The two genes demonstrated good diagnostic efficacy in the hyperthyroidism validation set GSE276271 (AUC: TMEM127, 0.636; CXCL16, 0.591) and in the AF validation set GSE2240 (AUC: TMEM127, 0.745; CXCL16, 0.720). RT – qPCR analysis demonstrated that CXCL16 and TMEM127 expression levels were significantly elevated in both the hyperthyroidism and AF groups compared to the control group, aligning with the findings from our prior bioinformatics analysis. Immune analysis revealed significant differences in two immune cell types in both hyperthyroidism and AF. Conclusion: CXCL16 and TMEM127 are promising biomarkers, offering insights into the shared pathogenesis of hyperthyroidism and AF. These findings provide a foundation for novel diagnostic and therapeutic strategies targeting these conditions.
Keywords: Hyperthyroidism, Atrial Fibrillation, weighted gene co-expression network analysis, machine learning, biomarkers
Received: 28 Aug 2025; Accepted: 10 Nov 2025.
Copyright: © 2025 Wang, Yang, Kang, Liu and Deng. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence: Yongzhi Deng, olympicschina@163.com
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.
