Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Oncol.

Sec. Gastrointestinal Cancers: Hepato Pancreatic Biliary Cancers

Volume 15 - 2025 | doi: 10.3389/fonc.2025.1683704

Machine Learning-Based Identification of Core Regulatory Genes in Hepatocellular Carcinoma: Insights from Lactylation Modification and Liver Regeneration-Related Genes

Provisionally accepted
  • 1The First College of Clinical Medical Science, China Three Gorges University, Yichang, China
  • 2Department of Hepatobiliary Surgery, Yichang Central People’s Hospital, Yichang, China
  • 3Department of Endocrinology, Yichang Central People’s Hospital, Yichang, China
  • 4Medical Technology College of Qiqihar Medical College, Qiqihar, China

The final, formatted version of the article will be published soon.

Hepatocellular carcinoma (HCC) progression shares metabolic-epigenetic features with physiological liver regeneration, yet the regulatory interplay remains poorly defined. Here, we integrate lactylation modification profiles with liver regeneration-associated transcriptomes and employ machine learning to prioritize core regulatory genes driving HCC pathogenesis. Multi-omics analysis of three murine liver regeneration datasets (GSE20426, GSE70593, GSE4528) revealed 793 differentially expressed genes (470 upregulated, 323 downregulated), with 18 overlapping lactylation-related candidates. Machine learning (LASSO regression and SVM-RFE) prioritised six core genes (Ccna2, Csrp2, Ilf2, Kif2c, Racgap1, Vars) enriched in cell cycle regulation and DNA repair pathways. Functional characterisation demonstrated their strong correlation with immune microenvironment remodelling, particularly CD8⁺ T cells and M1 macrophages. Prognostic validation in HCC cohorts revealed significant overexpression of these genes in tumours versus normal tissues (p < 0.05), with elevated Kif2c and Ccna2 predicting poor survival. Crucially, Csrp2 exhibited superior diagnostic efficacy (AUC > 0.8) compared to conventional biomarkers. Experimental validation via qPCR confirmed marked upregulation of all six genes in clinical HCC specimens (p < 0.0001). Furthermore, Western blot demonstrated significantly elevated protein levels of CCNA2, CSRP2, ILF2, KIF2C, RACGAP1, and VARS in HCC tissues compared to adjacent non-tumor controls, corroborating their tumor-specific overexpression. Previous studies have explored lactylation in HCC pathogenesis, while this work uniquely establishes lactylation as a metabolic-epigenetic nexus linking physiological regenerative pathways to oncogenesis—by leveraging liver regeneration models and machine learning-based gene prioritization—and proposes the identified gene panel as dual-purpose biomarkers for HCC diagnosis and therapeutic targeting.

Keywords: lactylation, Liver Regeneration, Hepatocellular Carcinoma, machine learning, Bioinformatics analysis

Received: 11 Aug 2025; Accepted: 14 Oct 2025.

Copyright: © 2025 Xing, Yang, Hou, Yi, Chen, Li, Wang, Fu and Hu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Mingzheng Hu, humingzheng@ctgu.edu.cn

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.