Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Endocrinol.

Sec. Cancer Endocrinology

This article is part of the Research TopicUnraveling the Intricate Nexus: Pancreatic Cancer in the Context of Metabolic Syndrome, Diabetes - Associated Molecular Signatures, and Endocrine Signaling CascadesView all articles

Machine Learning-Optimized Metabolic Biomarker Panel for Precision Screening of Early-Stage Pancreatic Cancer in New-Onset Diabetes

Provisionally accepted
Weiliang  JiangWeiliang Jiang1Zhiyaun  ChengZhiyaun Cheng1Rong  MuRong Mu1Haoran  SunHaoran Sun1Zihao  GuoZihao Guo1Guo  YuGuo Yu1Dongyan  WangDongyan Wang2*Lijuan  YangLijuan Yang1*
  • 1Department of Gastroenterology, Shanghai General Hospital, Shanghai, China
  • 2Shanghai Pudong New Area Gongli Hospital, Shanghai, China

The final, formatted version of the article will be published soon.

New-onset diabetes (NOD) represents a high-risk population for pancreatic ductal adenocarcinoma (PDAC); however, robust early detection tools remain elusive. To address this challenge, we conducted serum metabolomic profiling using ultra-high-performance liquid chromatography tandem mass spectrometry (UHPLC-MS/MS) in 133 NOD patients aged over 65 years: 60 with PDAC (PDAC+NOD) and 73 without (NOD). Multivariate orthogonal partial least squares discriminant analysis (OPLS-DA) and machine learning were employed to identify and optimize a diagnostic metabolic biomarker panel. Our analysis revealed 62 differentially expressed serum metabolites (P < 0.05, FDR-corrected), predominantly implicating branched-chain amino acid metabolism, primary bile acid biosynthesis, and sphingolipid signaling pathways. Notably, significant reductions in one-carbon metabolism-related metabolites (e.g., serine, glycine, homocysteine) were observed in PDAC+NOD patients. Feature selection yielded an optimized 5-metabolite panel comprising glycine, L-serine, L-methionine, L-homocysteine, and L-homocystine. This panel demonstrated high diagnostic accuracy in the validation cohort, achieving 75.0% accuracy in distinguishing PDAC+NOD from NOD patients, with an area under the curve (AUC) of 0.853 (95% CI: 0.786-0.920). These findings establish a foundational metabolic biomarker strategy for precision screening of early-stage PDAC in NOD populations. The dysregulated one-carbon metabolites provide novel mechanistic insights into PDAC pathogenesis and offer actionable targets for clinical assay development.

Keywords: Metabolic biomarkers, Pancreatic Ductal Adenocarcinoma, New-onsetdiabetes, Early detection, Metabolomics, machine learning

Received: 12 Aug 2025; Accepted: 21 Nov 2025.

Copyright: © 2025 Jiang, Cheng, Mu, Sun, Guo, Yu, Wang and Yang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence:
Dongyan Wang, wdy01635@glhospital.com
Lijuan Yang, humourlife001@163.com

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.