ORIGINAL RESEARCH article
Front. Neurosci.
Sec. Neuroscience Methods and Techniques
AD Diagnosis Model Based on Fusion of Heterogeneous Brain Imaging and Genomic Data
Zhihao Zhang 1,2
Ruixia Zhang 2
Wenzhong Yang 1
Ke Lu 1
Miao Wu 2
Lianghui Xu 2
1. Xinjiang University, Urumqi, China
2. Xinjiang Medical University, Urumqi, China
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
Abstract
Alzheimer's disease (AD) is a common neurodegenerative disorder in the elderly population, and early screening can effectively delay the progression of the disease. Mild cognitive impairment (MCI) occurs prior to the onset of AD; however, the accuracy of existing MCI-to-AD prediction methods remains relatively low. Additionally, small sample sizes and high feature dimensions often lead to model overfitting, highlighting the need for effective early screening approaches. To address the aforementioned issues, this study integrated non-paired multi-modal features—including clinical indicators from the ADNI database, blood biomarkers, brain region volume features extracted from MRI, and genetic biomarkers from the GEO database—and proposed a gender-corrected random matching strategy. The Random Forest algorithm was adopted to evaluate this strategy, analyze feature importance, and compare the performance of 9 machine learning algorithms based on the top 40 ranked features. The predictive performance of multi-modal data was superior to that of single-modal data, and the proposed strategy achieved favorable results in early AD screening. 16 specific genetic features (e.g., IFI27, EDF1, RAP2A, KIF5C, SERPINA3, FBXW7, IFITM1, ISG15, PSMB3, APOE4, KCNB1, PSPH, HMGN2, S100A13, IFIT3, and CALM1) and 6 brain region volume features ranked high in terms of importance. When validated using paired datasets from ADNI across the 9 algorithms, ensemble learning models demonstrated significantly stronger fitting capabilities. The non-paired multimodal fusion approach not only expands the sample size but also enhances the generalization ability and robustness of the model. This provides a theoretical basis for the application of this strategy in the field of small-sample medical research.
Summary
Keywords
Alzheimer's disease diagnosis, bioinformatics, Early Screening Diagnosis, machine learning, Multimodal Fusions
Received
06 October 2025
Accepted
16 February 2026
Copyright
© 2026 Zhang, Zhang, Yang, Lu, Wu and Xu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Zhihao Zhang; Ke Lu; Miao Wu; Lianghui Xu
Disclaimer
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.