ORIGINAL RESEARCH article

Front. Oncol.

Sec. Gynecological Oncology

Volume 15 - 2025 | doi: 10.3389/fonc.2025.1527674

This article is part of the Research TopicPrognostic Biomarkers and Gene Signatures in Endometrial, Ovarian, and Cervical CancerView all 17 articles

A Machine Learning Model for Predicting Lymph Node Positivity in Ovarian Cancer: Development, Validation, and Clinical Application

Provisionally accepted
  • Department of Orthopedics, Fujian Maternity and Child Health Hospital, Fuzhou, Fujian Province, China

The final, formatted version of the article will be published soon.

Ovarian cancer (OC) remains a highly lethal gynecological malignancy, often diagnosed at advanced stages with a poor prognosis. Lymph node involvement is a critical prognostic factor and significantly influences treatment planning. However, accurately predicting lymph node positivity remains challenging due to the disease's heterogeneity and the limitations of traditional models in handling high-dimensional and imbalanced data.A retrospective analysis was conducted using the SEER database , including 26,844 OC patients with complete clinical information. We developed a machine learning model incorporating multiple algorithms, with XGBoost demonstrating superior performance. SMOTE was used to address class imbalance, and LASSO regression aided in selecting key predictors such as tumor size, histology, chemotherapy, and surgery. Model performance was assessed via accuracy, sensitivity, specificity, F1 score, and AUC, with external validation performed using an independent cohort from Fujian Provincial Maternity and Children's Hospital.The XGBoost model achieved an AUC of 0.98 (95% CI: 0.975-0.986) in the training set and 0.847 (95% CI: 0.823-0.871) in external validation. The model demonstrated high sensitivity and robust performance in identifying lymph node-positive cases. Tumor size ≥5 cm, histological subtype, and chemotherapy were key predictive features, with SHAP analysis identifying tumor size as the most influential factor.We present the first machine learning model specifically developed for predicting lymph node positivity in OC, validated across large, diverse cohorts. To facilitate clinical translation, we developed a free, user-friendly online calculator, which allows clinicians to quickly estimate lymph node positivity risk using patient-specific clinical parameters. This tool can be accessed at http://127.0.0.1:6818 and serves as a practical, evidence-based aid to support individualized treatment decisions and potentially improve patient outcomes. Future studies should integrate molecular data and broaden external validation to enhance generalizability.

Keywords: ovarian cancer, Lymph node positivity, machine learning, XGBoost, prognosis

Received: 13 Nov 2024; Accepted: 09 Jun 2025.

Copyright: © 2025 Guo, Chen, Xiangpeng, Hu and Wang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence:
Zhan Xiangpeng, Department of Orthopedics, Fujian Maternity and Child Health Hospital, Fuzhou, 350001, Fujian Province, China
LiPing Hu, Department of Orthopedics, Fujian Maternity and Child Health Hospital, Fuzhou, 350001, Fujian Province, China
Jinji Wang, Department of Orthopedics, Fujian Maternity and Child Health Hospital, Fuzhou, 350001, Fujian Province, China

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.