AUTHOR=Chen Zhe , Qiang Min , Hong Yang , Tian Weibo , Tang Mingbo , Liu Wei TITLE=Machine learning-based preoperative prediction of perioperative venous thromboembolism in Chinese lung cancer patients: a retrospective cohort study JOURNAL=Frontiers in Oncology VOLUME=Volume 15 - 2025 YEAR=2025 URL=https://www.frontiersin.org/journals/oncology/articles/10.3389/fonc.2025.1588817 DOI=10.3389/fonc.2025.1588817 ISSN=2234-943X ABSTRACT=BackgroundPerioperative venous thromboembolism (VTE) is a severe complication in lung cancer surgery. Traditional prediction models have limitations in handling complex clinical data, whereas machine learning (ML) offers enhanced predictive accuracy. This study aimed to develop and validate an ML-based model for preoperative VTE risk assessment.MethodsA retrospective cohort of 1,013 lung cancer patients who underwent surgery at the First Hospital of Jilin University (April 2021–December 2023) was analyzed. Preoperative clinical and laboratory data were collected, and six key predictors—age, mean corpuscular volume, mean corpuscular hemoglobin, fibrinogen, D-dimer, and albumin—were identified using univariate analysis and Lasso regression. Eight ML models, including extreme gradient boosting (XGB), random forest, logistic regression, and support vector machines, were trained and evaluated using AUC, precision-recall curves, decision curve analysis, and calibration curves.ResultsVTE occurred in 175 patients (17.3%). The XGB model demonstrated the highest predictive performance (AUC: 0.99 training, 0.66 validation; AUPRC: 0.323), with age and mean corpuscular volume identified as the most influential predictors. An online prediction tool was developed for clinical application.ConclusionThe ML-based XGB model provides a reliable preoperative risk assessment for VTE in lung cancer patients, enabling early risk stratification and personalized thromboprophylaxis.