AUTHOR=Liu Jian , Xue Xinzheng , Wen Pengbo , Song Qian , Yao Jun , Ge Shuguang TITLE=Multi-fusion strategy network-guided cancer subtypes discovering based on multi-omics data JOURNAL=Frontiers in Genetics VOLUME=Volume 15 - 2024 YEAR=2024 URL=https://www.frontiersin.org/journals/genetics/articles/10.3389/fgene.2024.1466825 DOI=10.3389/fgene.2024.1466825 ISSN=1664-8021 ABSTRACT=IntroductionThe combination of next-generation sequencing technology and Cancer Genome Atlas (TCGA) data provides unprecedented opportunities for the discovery of cancer subtypes. Through comprehensive analysis and in-depth analysis of the genomic data of a large number of cancer patients, researchers can more accurately identify different cancer subtypes and reveal their molecular heterogeneity.MethodsIn this paper, we propose the SMMSN (Self-supervised Multi-fusion Strategy Network) model for the discovery of cancer subtypes. SMMSN can not only fuse multi-level data representations of single omics data by Graph Convolutional Network (GCN) and Stacked Autoencoder Network (SAE), but also achieve the organic fusion of multi- -omics data through multiple fusion strategies. In response to the problem of lack label information in multi-omics data, SMMSN propose to use dual self-supervise method to cluster cancer subtypes from the integrated data.ResultsWe conducted experiments on three labeled and five unlabeled multi-omics datasets to distinguish potential cancer subtypes. Kaplan Meier survival curves and other results showed that SMMSN can obtain cancer subtypes with significant differences.DiscussionIn the case analysis of Glioblastoma Multiforme (GBM) and Breast Invasive Carcinoma (BIC), we conducted survival time and age distribution analysis, drug response analysis, differential expression analysis, functional enrichment analysis on the predicted cancer subtypes. The research results showed that SMMSN can discover clinically meaningful cancer subtypes.