Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Med.

Sec. Precision Medicine

Volume 12 - 2025 | doi: 10.3389/fmed.2025.1574546

A Study on the Risk Prediction Model for Venous Thromboembolism in Orthopedic Inpatients Based on Machine Learning

Provisionally accepted
zhang  bozhang bo1Qin  YumeiQin Yumei2Jiu  LiandiJiu Liandi1Qin  ChunmingQin Chunming2Wang  JiangboWang Jiangbo2Zhao  HaiqingZhao Haiqing2*
  • 1Digital China Health Technologies Co. Ltd, Beijing, China
  • 2Nanxishan Hospital of Guangxi Zhuang Autonomous Region, Gulin, China

The final, formatted version of the article will be published soon.

Objective: To construct a venous thromboembolism (VTE) risk prediction model for orthopedic inpatients using machine learning modeling techniques, identify high-risk patients, and optimize clinical interventions. Methods: This study involved a retrospective analysis of 286 orthopedic inpatients from Nanxishan Hospital of Guangxi Zhuang Autonomous Region (The Second People's Hospital of Guangxi Zhuang Autonomous Region) from January 1, 2022 to December 31, 2022. To ensure patient information security, all data were fully anonymized before access. The collected data included basic information (such as gender, age, ethnicity, and body mass index (BMI), lifestyle factors and medical history (including smoking, alcohol use, diabetes, hypertension, and personal and family history of VTE), clinical test results (such as thrombin time, plasma D-dimer, total bilirubin, and urinary protein via dry chemistry), as well as genetic test results related to VTE risk. Feature analysis and data mining were conducted, and eight different machine learning algorithms were used to build the prediction model. The SHapley Additive exPlanation (SHAP) method was used to rank the feature importance and explain the final model. Results: Through a comprehensive evaluation and comparison of eight different machine learning models, the results clearly indicate that the XGBoost model outperforms the others across all performance metrics, achieving the highest accuracy of 0.828 and AUROC of 0.931, significantly surpassing the other models, particularly in prediction accuracy and discriminative ability. Compared to the traditional Caprini scoring model, XGBoost not only shows improvements in accuracy and specificity but also demonstrates a significant increase in Area Under the Curve (AUC), further validating its superior performance in VTE risk prediction.This model can be effectively used for early risk prediction of VTE, helping to reduce the incidence of venous thromboembolism in orthopedic patients.Given its promising results, further validation and wider application of the model in clinical settings are warranted to enhance patient outcomes and improve preventive strategies.

Keywords: Venous Thromboembolism, machine learning, Risk Assessment, Orthopedic inpatients, Clinical decision support

Received: 14 Feb 2025; Accepted: 16 Jun 2025.

Copyright: © 2025 bo, Yumei, Liandi, Chunming, Jiangbo and Haiqing. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Zhao Haiqing, Nanxishan Hospital of Guangxi Zhuang Autonomous Region, Gulin, China

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.