Your new experience awaits. Try the new design now and help us make it even better

METHODS article

Front. Bioinform.

Sec. Integrative Bioinformatics

This article is part of the Research TopicFrom codes to cells to care, transforming health care with AI – Proceedings of the 20th Annual Meeting of the MidSouth Computational Biology and Bioinformatics Society (MCBIOS)View all 4 articles

Genetic Risk Predictions Using Deep Learning Models with Summary Data

Provisionally accepted
  • Texas State University, San Marcos, United States

The final, formatted version of the article will be published soon.

Background As a driving force of the Fourth Industrial Revolution, deep learning methods have achieved significant success across various fields, including genetic and genomic studies. While individual-level genetic data is ideal for deep learning models, privacy concerns and data-sharing restrictions often limit its availability to researchers. Methods In this paper, we investigated the potential applications of deep learning models—including deep neural networks, convolutional neural networks, recurrent neural networks, and transformers— when only genetic summary data, such as linkage disequilibrium matrices, is available. The bootstrap method was used to approximate the test error. Simulation studies and real data analyses were conducted to compare the performance of deep learning methods in genetic risk prediction using individual-level genetic data versus genetic summary data. Results The test mean squared errors (MSEs) of most applied deep learning models are comparable when using individual-level data versus summary data. Conclusion Our results suggest that suitable deep learning methods could also serve as an alternative approach to predict disease related traits when only linkage disequilibrium matrices are available as input.

Keywords: Bootstrap, deep neural networks, Linkage Disequilibrium, risk prediction, Singlenucleotide polymorphisms

Received: 30 Jun 2025; Accepted: 15 Dec 2025.

Copyright: © 2025 Wang, Xiao, Cheng and Shen. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Xiaoxi Shen

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.