ORIGINAL RESEARCH article
Front. Genet.
Sec. Computational Genomics
Volume 16 - 2025 | doi: 10.3389/fgene.2025.1570781
This article is part of the Research TopicAdvancements in AI for the Analysis and Interpretation of Large-scale Data by Omics TechniquesView all 6 articles
Reference-free deconvolution of complex samples based on cross-cell type differential analysis: systematical evaluations with various feature selection options
Provisionally accepted- School of Mathematical Information, Shaoxing University, Shaoxing, China
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
Genomic and epigenomic data from complex samples reflect the average level of multiple cell types. However, differences in cell compositions can introduce bias into many relevant analyses. Consequently, the accurate estimation of cell compositions has been regarded as an important initial step in the analysis of complex samples. A large number of computational methods have been developed for estimating cell compositions, however, their applications are limited due to the absence of reference or prior information. As a result, reference-free deconvolution has the potential to be widely applied, due to its flexibility. A previous study emphasized the importance of feature selection for improving estimation accuracy in reference-free deconvolution. In this paper, we systematically evaluated five feature selection options and developed an optimal feature-selection-based referencefree deconvolution method. Our proposal iteratively searches for cell-type specific features by integrating cross-cell type differential analysis between one cell type and the other cell types, as well as between two cell types and the other cell types, and performs composition estimation. Comprehensive simulation studies and analyses of seven real datasets show the excellent performance of the proposed method. The proposed method RFdecd is implemented as an R package at https://github.com/wwzhang-study/RFdecd.
Keywords: Reference-free deconvolution, Feature Selection, cross-cell type differential analysis, cell compositions, Gene Expression, DNA Methylation
Received: 04 Feb 2025; Accepted: 05 May 2025.
Copyright: © 2025 Zhang, Tian and Peng. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence: Weiwei Zhang, School of Mathematical Information, Shaoxing University, Shaoxing, China
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.