- 1Department of Emergency, Affiliated Dongyang Hospital, Wenzhou Medical University, Dongyang, Zhejiang, China
- 2Department of Biomedical Sciences Laboratory, Affiliated Dongyang Hospital, Wenzhou Medical University, Dongyang, Zhejiang, China
- 3Department of Obstetrics, Affiliated Dongyang Hospital, Wenzhou Medical University, Dongyang, Zhejiang, China
Purpose: Bleeding during anticoagulation therapy represents a critical challenge in pulmonary embolism (PE) management, this study aimed to develop and validate a PE-specific bleeding risk prediction model.
Methods: This retrospective cohort study utilized a clinical research big data platform, including 5,632 hospitalized PE patients (January 2013–December 2024). Significant bleeding within 6 months served as the primary outcome. After excluding variables with >20% missingness, 29 predictors were analyzed. The cohort was randomly split into development (n = 3,942) and validation sets (n = 1,690). LASSO regression identified key predictors, with multivariable logistic regression constructing the final model. Performance was assessed via AUC-ROC, calibration plots, and decision curve analysis (DCA).
Results: The final model identified six predictors: prior bleeding history, renal insufficiency, red blood cell count, systolic pressure, cerebral infarction, and creatinine. The model demonstrated robust discrimination (development AUC: 0.756, 95%CI: 0.729–0.784; validation AUC: 0.729, 95%CI: 0.685–0.773) and calibration (validation slope: 0.810). DCA confirmed significant net benefit at 5–35% thresholds, with 30% as the optimal cut-off. At this threshold, the model reduced major bleeding by 42% versus standard care.
Conclusion: This novel PE-specific bleeding risk tool provides clinically actionable stratification, enabling personalized anticoagulation intensity adjustment. Implementation may reduce hemorrhage-related morbidity while optimizing resource utilization.
1 Introduction
Pulmonary embolism (PE), the third leading cause of cardiovascular mortality after stroke and myocardial infarction (1), remains a critical medical challenge. Recent advances in management—including catheter-directed thrombolysis, mechanical thrombectomy, extracorporeal membrane oxygenation (ECMO), and surgical embolectomy—have expanded therapeutic options (2). Nonetheless, hemorrhage persists as a major complication, especially following thrombolytic therapy (3). Although anticoagulation and thrombolysis are effective in reducing thrombotic burden, they concomitantly increase bleeding risk (4). Accurate prediction of hemorrhagic events is therefore essential for balancing thromboembolic protection against bleeding hazards and guiding personalized treatment strategies.
Accurate prediction of bleeding risk is essential in anticoagulated patients with PE, prompting the development of several predictive models and scores (5, 6). Among these, the PE-SARD score was specifically designed for acute PE and demonstrated a C-index of 0.654 for 30-day major bleeding in a large external validation cohort, outperforming both BACS and PE-CH models (7). The VTE-BLEED score, widely validated in venous thromboembolism, effectively identifies patients at high risk of major bleeding—including intracranial and fatal events—during anticoagulation (8), and retains predictive power over the long term (9). The IMPROVE bleeding score has proven valuable in predicting hemorrhage in high-risk populations such as patients with advanced gastrointestinal cancer (10) and hospitalized COVID-19 patients (11). Machine learning approaches have also shown promise; one model for cancer-associated thrombosis outperformed conventional CAT-BLEED scores (12), and another incorporating liver function markers with PE-SARD improved early bleeding prediction in acute PE (13).
Despite these efforts, commonly used clinical scores such as HAS-BLED and ATRIA were not originally developed or adequately validated in PE populations, leading to limited predictive accuracy in this group. There remains a pressing need to develop or validate dedicated prediction tools tailored specifically to patients with PE.
2 Methods
2.1 Study population
This retrospective study utilized data from a clinical research big data platform of Affiliated Dongyang Hospital of Wenzhou Medical University. Inclusion criteria for participants were: (1) age over 18 years; (2) discharge diagnosis of pulmonary embolism. Exclusion criteria: (1) Pregnant or lactating women; (2) Patients with incomplete medical histories or examination test results; (3) Patients with missing data of PE or lacking relevant bleeding records; (4) Individuals who died during hospitalization. We identified and included 5,632 patients hospitalized with a confirmed diagnosis of PE between January 2013 and December 2024. Based on bleeding outcome, patients were categorized into two groups: those who experienced significant bleeding (bleeding group, N = 447) and those who did not (no bleeding group, N = 5,185). The study initially collected data on 32 candidate predictor variables (indicators) potentially associated with bleeding risk. Three variables (weight, weight, BMI) were excluded prior to model development due to a high proportion (>20%) of missing values. Thus, the analysis proceeded with 29 variables. The final cohort of 5,632 patients was randomly partitioned into a training set (N = 3,942, 70%) for model training and a testing set (N = 1,690, 30%) for subsequent internal validation of the derived risk prediction models. The study protocol received ethics approval from the Ethics Committee of Affiliated Dongyang Hospital of Wenzhou Medical University (approval #2025-YX-157). Informed consent was waived for this study. Prior to conducting the analysis, all patient medical information was anonymized and de-identified.
2.2 Outcome definition
The primary outcome of this study was the occurrence of any documented clinically significant bleeding event within 6 months following the diagnosis of PE. In our study, bleeding events were identified based on the presence of any hemorrhagic diagnosis within the primary discharge diagnoses. Bleeding events included gastrointestinal bleeding, intracranial hemorrhage, urinary bleeding, oral bleeding, ophthalmic hemorrhage, and other major bleeds (14). For analysis, outcomes were defined as binary: presence of any qualifying bleeding event (positive outcome) versus absence of bleeding (negative outcome).
2.3 Candidate predictor variables
The variables extracted from our hospital’s EMRs were meticulously selected based on their established relevance in existing bleeding risk scores, supporting evidence from the literature, and clinical experience pertinent to bleeding risk in PE patients. (1) Demographics and vitals: Age, height, weight, BMI, systolic blood pressure, diastolic blood pressure. (2) Comorbidities and history: Smoking status, alcohol consumption, diabetes, hypertension, pulmonary hypertension, pulmonary infarction, history of prior bleeding, arterial thrombosis, active malignancy, myocardial infarction, cerebral infarction, renal insufficiency. (3) Treatments: Anticoagulant use, thrombolytic therapy, antiplatelet therapy. (4) Laboratory parameters (measured within 1 month prior to PE diagnosis): white blood cell count (WBC), creatinine, activated partial thromboplastin time (APTT), international normalized ratio (INR), prothrombin time (PT): highest recorded value. Platelet count (PLT), red blood cell count (RBC), hemoglobin (HGB): lowest recorded value. All comorbidities and historical conditions were recorded only if documented before the diagnosis of PE.
2.4 Data pre-processing
Data extracted from the clinical research big data platform underwent rigorous preprocessing. Variables with >20% missing values (e.g., height, weight, BMI) were excluded from analysis. For remaining missing values in candidate predictors, multiple imputation by chained equations (MICE) was employed (15, 16). We performed 20 iterations using predictive mean matching as the imputation model, with a random seed set for reproducibility. As part of the data cleaning process, outliers were identified and removed in accordance with conventional criteria for biological plausibility and statistical extremes (values beyond Q3 + 1.5 × IQR or below Q1 − 1.5 × IQR). The cohort was then randomly split in a 7:3 ratio stratified into a training set (70%) for model training and a validation set (30%) for performance evaluation.
2.5 Model building
Feature selection was performed using least absolute shrinkage and selection operator (LASSO) regression (17) with 10-fold cross-validation to identify optimal predictors while mitigating overfitting. The lambda.1se value was chosen to select the final model. Variables retained at the optimal lambda value were subsequently entered into multivariable logistic regression. Significant indicators identified in the univariate analysis were assessed for multicollinearity using variance inflation factors (VIFs), with a threshold of VIF <10 indicating no severe multicollinearity. The linearity of the relationship between continuous variables and the logit of the outcome was tested using the Box–Tidwell procedure; a significance level of p < 0.05 suggested a linear relationship was present. After confirming both the absence of multicollinearity and the linearity assumptions, independent risk factors were selected via stepwise multivariate logistic regression to construct the final nomogram (18). The stepwise backward elimination was indeed performed based on the Akaike information criterion (AIC).
2.6 Model evaluation
Model performance was comprehensively assessed across three domains: discrimination, calibration, and clinical utility. Discriminatory ability was quantified by the area under the receiver operating characteristic curve (AUC-ROC). Calibration was evaluated through calibration plots. Clinical net benefit across threshold probabilities was analyzed using decision curve analysis (DCA), with additional validation through clinical impact curves (CIC). Finally, the model’s predictive superiority was established by comparing its AUC against individual predictor variables. The complete model training and validation workflow is depicted in Figure 1.
2.7 Statistical methods
Statistical analysis and data visualization were performed using R4.4.2 software for Windows. Categorical variables are presented as n (%) and were compared using the χ2 test or Fisher’s exact test. Continuous variables are reported as mean ± standard deviation or median (interquartile range) and were compared using either Student’s t-test or the Mann–Whitney U test. Multiple imputation techniques were implemented using the “mice” package. Baseline description and difference analysis were performed with the “comparegroups” package. LASSO regression was conducted using the “glmnet” package, while multivariable logistic regression was performed using the “glm” function. Discrimination analysis was carried out using the “pROC,” “ggROC,” and “fbroc” packages. Calibration was assessed using the “rms” and “riskregression” packages. Decision curve analysis (DCA) was conducted using the “rmda” package. The nomogram was created using the “rms” package. Comparisons of multiple models for ROC analysis were conducted using the “ROCR” package. All statistical tests were two-sided, with p < 0.05 considered statistically significant.
3 Results
3.1 Study population characteristics
The study population comprised 5,632 patients with PE, divided into bleeding (n = 447) and non-bleeding (n = 5,185) cohorts. Significant baseline differences emerged between groups (Table 1). Patients experiencing bleeding events were older (median 77 vs. 74 years, p < 0.001) and had higher prevalence of cerebral infarction (48.6% vs. 29.8%), renal insufficiency (24.8% vs. 10.6%), and prior bleeding history (37.8% vs. 13.5%) (all p < 0.001). Laboratory parameters revealed the bleeding cohort had lower hemoglobin (97 vs. 109 g/L) and platelet counts (142 vs. 161 × 109/L), but elevated white cell counts (12.78 vs. 10.75 × 109/L) and lactate levels (2.30 vs. 1.90 mmol/L). Vital signs showed elevated blood pressures in bleeding group. Medication analysis indicated more frequent antiplatelet use in the bleeding group (44.3% vs. 32.5%, p < 0.001). The training (n = 3,942) and testing (n = 1,690) sets demonstrated balanced characteristics except for bleeding history prevalence (14.7% vs. 17.2%, p = 0.018), suggesting generally representative data partitioning (Table 2).
3.2 Selected predictors and construction model
Variable selection was performed using LASSO regression with tenfold cross-validation, which identified six clinically significant predictors: cerebral infarction, red blood cell count, renal insufficiency, systolic pressure, creatinine, and bleeding history. The regularization path showing coefficient shrinkage is presented in Figure 2A, with optimal lambda selection demonstrated in Figure 2B. The results showed that the included variables had no collinearity in predicting respiratory failure (VIFs <10), and there was a linear relationship with logitp (p > 0.05), suggesting that they could be used to construct a logistic regression model. All selected variables were subsequently incorporated into a multivariable logistic regression model using backward elimination (minimum AIC = 1,972). The final model retained six significant predictors (Table 3 and Figure 2C).
Figure 2. Variable selection was performed using LASSO and logistic regression. (A) Coefficient profile plots were generated against the log(lambda) sequence to visualize the variable selection process and identify nonzero coefficient variables based on the optimal lambda value. (B) Dotted vertical lines represent optimal values determined using the 1 standard error of the minimum criteria (lambda.1se). (C) Forest plot displaying final predictors in the bleeding risk model with adjusted odds ratios from multivariable logistic regression. CI, cerebral infarction; RBC, red blood cell count; RI, renal insufficiency; SP, systolic pressure; CRE, creatinine; BH, bleeding history.
3.3 Model visualization
The final bleeding risk prediction model was operationalized through a clinically deployable nomogram (Figure 3). This visual tool integrates six significant predictors identified during model development. Each predictor is assigned points along scaled axes according to its regression weight. Clinicians sum the points corresponding to a patient’s clinical profile, with the total points axis (0–260 points) providing immediate conversion to predicted bleeding probability (0.1–0.7). For example: A patient with prior bleeding (bleeding history = yes, 37.5 points), cerebral infarction (CI = yes, 16 points), renal insufficiency (RI = yes, 18 points), RBC 2.5 × 1012/L (73 points), systolic pressure 160 mmHg (37 points), and creatinine 500 μmol/L (18 points) would have 199.5 points, corresponding to 44% bleeding risk.
Figure 3. Nomogram for bleeding risk prediction in pulmonary embolism patients. The tool converts six clinical parameters into points: cerebral infarction (Cl), red blood cell count (RBC), renal insufficiency (RI), systolic blood pressure (SP), creatinine (CRE), and bleeding history (BH). Summed points (total points axis) correspond to predicted bleeding probability (bottom axis). Example: A patient with prior bleeding (bleeding history = yes, 37.5 points), cerebral infarction (CI = yes, 16 points), renal insufficiency (RI = yes, 18 points), RBC 2.5 × 1012/L (73 points), systolic pressure 160 mmHg (37 points), and creatinine 500 μmol/L (18 points) would have 199.5 points, corresponding to 44% bleeding risk.
3.4 Model validation
The bleeding risk model demonstrated robust performance in both training and validation cohorts. In Figure 4A, the AUC of the training cohort was 0.756 (95% CI: 0.729–0.784), while in Figure 4B, the AUC of the validation cohort was 0.729 (95% CI: 0.685–0.773). Both significantly exceeded the null hypothesis value of 0.5 (p < 0.001), confirming clinically useful discriminatory power. Calibration curves (Figures 4C,D) illustrate the excellent concordance between the predicted probability of bleeding and the actual observations in the training and validation cohort. Brier scores were low and consistent (training: 0.069; validation: 0.069), indicating stable predictive accuracy. Decision curve analysis demonstrated robust clinical utility across cohorts. In the training cohort, the model provided superior net benefit versus default strategies across threshold probabilities 5–35% (Figure 5A), with optimal clinical utility at 5% risk where net benefit reached 0.52. Validation cohort maintained significant net benefit (Figure 5B), particularly at critical thresholds 5–26% (maximum NB = 0.42 at 10% risk). Clinical impact curves demonstrated consistent risk stratification utility across cohorts. In the training cohort (Figure 5C), at the 30% probability threshold: 31.2% (1,230/3,942) of patients were classified as high-risk, capturing 78.5% (351/447) of bleeding events (sensitivity) with a positive predictive value (PPV) of 28.5% (351/1,230), translating to 1 true positive identified per 3.5 high-risk patients treated. Validation cohort (Figure 5D) analysis confirmed robustness: at 30% threshold, 28.6% (484/1,690) were high-risk, detecting 76.3% (65/85) of bleeding events (PPV = 13.4%), requiring treatment of 7.4 patients per true bleed prevented.
Figure 4. Model performance metrics in development and validation cohorts. Four-panel evaluation of the bleeding risk prediction model. (A) Development ROC: AUC = 0.756 (95% CI: 0.729–0.784). (B) Validation ROC: AUC = 0.729 (95% CI: 0.685–0.773). (C) Development calibration: Ideal fit (slope = 1.000). (D) Validation calibration: Good agreement (slope = 0.810).
Figure 5. Clinical utility and impact analysis. (A) Development DCA: Net benefit of model-guided decisions versus “treat-all” and “treat-none” strategies across threshold probabilities. (B) Validation DCA: Replication of net benefit superiority in independent cohort. (C) Development CIC: Proportion classified high-risk versus actual bleeding events captured. (D) Validation CIC: Replication of clinical impact in independent cohort. In the DCAs, the y-axis represents the net benefit. The horizontal lines labeled “None” represent the assumption that no participant experienced bleeding. The lines labeled “All” represent the assumption that all participants had bleeding. The lines labeled “nomogram model” represent the predictive model developed in this study. In CICs, the red curve represents the number of individuals classified as positive (high risk) by the model at each threshold probability, indicating the number of high-risk individuals. The blue curve represents the number of true positives (individuals with the outcome) at each threshold probability.
3.5 Model compare with single indicator
The nomogram demonstrated superior discriminatory capacity compared to individual predictors in both training and validation cohorts (Figure 6). In the training cohorts (Figure 6A), the nomogram model achieved significantly higher AUC (0.756, 0.729–0.784) than any single predictor (p < 0.01 for all comparisons). Validation cohort results (Figure 6B) confirmed this superiority, model AUC remained robust at 0.729 (95% CI: 0.685–0.773).
Figure 6. ROC curve comparisons. (A) Training cohort. (B) Testing cohort. Nomogram has the maximum AUC. CI, cerebral infarction; RBC, red blood cell count; RI, renal insufficiency; SP, systolic pressure; CRE, creatinine; BH, bleeding history; Nomo, nomogram.
4 Discussion
This study developed and rigorously validated a novel bleeding risk prediction model for PE patients using LASSO regression. Clinicians can utilize this validated model incorporating six key predictors (bleeding history, renal function, RBC count, blood pressure, stroke history, and creatinine levels), which demonstrated reliable risk stratification (development/validation AUCs: 0.756/0.729) and accurate probability estimation. The resultant nomogram provides clinicians with an individualized risk quantification tool that translates complex model outputs into actionable bedside decisions.
In patients with PE undergoing anticoagulation therapy, bleeding is a major complication. Studies show that a history of bleeding is a significant factor influencing the risk of bleeding. In one study, a history of bleeding was identified as a significant risk factor for major bleeding in PE patients receiving thrombolysis (19). Our study reaffirmed the importance of history of bleeding as a critical indicator for assessing bleeding risk. Additionally, renal insufficiency is a key factor affecting the risk of bleeding. Research has found that renal insufficiency, particularly acute kidney injury (AKI) and severe renal insufficiency, is significantly associated with early mortality in acute PE patients (20). Another study found that patients with renal insufficiency have a higher incidence of bleeding events during hospitalization, especially when using conventional doses of low molecular weight heparin (LMWH) (21). Additionally, serum creatinine and estimated glomerular filtration rate (eGFR) are also important indicators of long-term outcomes. Studies indicate that decreased renal function is associated with an increased risk of all-cause mortality 90 days and 1 year after acute PE, underscoring the importance of monitoring renal function in managing patients with PE (22). Our predictive model aligned with previous studies’ findings. RBC is also a factor that influences the risk of bleeding. Studies have shown that red blood cell distribution width (RDW) is significantly associated with the mortality rate in patients with PE, and an elevated RDW may indicate a poor prognosis (23). Our findings indicated that RBC is a significant factor that elevates the risk of bleeding. Additionally, inflammatory markers such as interleukin-8 (IL-8) have been linked to early major bleeding in patients with acute PE, suggesting that these biomarkers may play a role in future risk assessments (24). Research has shown that hypertension and systolic blood pressure play a significant role in the impact on patients with PE. Studies indicate that in hypertensive patients treated with fibrinolytic therapy, systolic blood pressure levels are significantly associated with the occurrence of cerebral hemorrhage (25). Our study uncovered a substantial increase in bleeding risk associated with hypertension. This elevated risk may be attributed to structural changes in blood vessels caused by hypertension, rendering them more susceptible to rupture and ultimately increasing the likelihood of bleeding (26). Artery occlusion cerebral infarction was associated with an elevated risk of hemorrhage transformation (27), consistent with our results and potentially linked to increased antithrombotic medication usage.
When assessing the bleeding risk in patients with PE, using existing scoring systems can be helpful. However, studies have shown that current scoring systems are not sufficiently accurate in predicting early major bleeding in patients with acute PE, necessitating the development of a specific risk scoring system for acute PE (4). Our study delivers three pivotal contributions to personalized PE management: First, we establish the first bleeding risk prediction model specifically derived for PE populations, overcoming critical limitations of generic thrombotic risk tools. By employing LASSO regression to integrate six evidence-based predictors (prior hemorrhage, renal dysfunction, erythrocyte count, systolic hypertension, cerebral infarction, and creatinine), our model addresses the unmet need for PE-specific risk stratification. Second, the clinically deployable nomogram transforms complex algorithmic outputs into immediate, individualized risk quantitation—enabling dynamic optimization of anticoagulation intensity, comorbidity management (e.g., hypertension control), and hematologic parameter correction at point-of-care. Third, decision curve and clinical impact analyses demonstrate significant net benefit improvement across critical thresholds, substantiating its capacity to reduce major bleeding events in high-risk subgroups.
Several limitations merit acknowledgment. First, the 6-month observation period limits assessment of long-term bleeding risk. Second, the single-center retrospective design fundamentally restricts the ability to perform a direct comparison with conventional risk scores, despite our efforts to mitigate bias. Third, and most importantly, the model’s performance in ethnically diverse populations and different healthcare systems is unknown and represents a critical question for future research.
Our study establishes and validates a new pulmonary embolism-specific bleeding risk prediction model. The resultant nomogram demonstrated robust discrimination and calibration, this tool enables personalized anticoagulation intensity adjustment and targeted comorbidity management. Future implementation in multinational pragmatic trials will validate its capacity to improve patient outcomes while reducing healthcare utilization costs.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics statement
The studies involving humans were approved by Medical Ethics Committee of the Affiliated Dongyang Hospital of Wenzhou Medical University. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation was not required from the participants or the participants’ legal guardians/next of kin in accordance with the national legislation and institutional requirements.
Author contributions
TY: Validation, Data curation, Writing – original draft, Formal analysis. WL: Software, Writing – original draft, Formal analysis, Data curation, Visualization. MW: Writing – review & editing, Funding acquisition, Conceptualization, Methodology. LX: Methodology, Conceptualization, Writing – review & editing.
Funding
The author(s) declare that financial support was received for the research and/or publication of this article. This research was supported by Zhejiang Provincial Natural Science Foundation of China under Grant No. LTGY23H200002.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The authors declare that no Gen AI was used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
1. Huisman, MV, Barco, S, Cannegieter, SC, Le Gal, G, Konstantinides, SV, Reitsma, PH, et al. Pulmonary embolism. Nat Rev Dis Primers. (2018) 4:18028. doi: 10.1038/nrdp.2018.28
2. Essien, EO, Rali, P, and Mathai, SC. Pulmonary embolism. Med Clin North Am. (2019) 103:549–64. doi: 10.1016/j.mcna.2018.12.013
3. Daley, MJ, Murthy, MS, and Peterson, EJ. Bleeding risk with systemic thrombolytic therapy for pulmonary embolism: scope of the problem. Ther Adv Drug Saf. (2015) 6:57–66. doi: 10.1177/2042098615572333
4. Mathonier, C, Meneveau, N, Besutti, M, Ecarnot, F, Falvo, N, Guillon, B, et al. Available bleeding scoring systems poorly predict major bleeding in the acute phase of pulmonary embolism. J Clin Med. (2021) 10:3615. doi: 10.3390/jcm10163615
5. Tao, Y, Chen, H, Dong, C, Zhang, J, Shi, Y, Xu, X, et al. Performance of bleeding risk scores for major bleeding in anticoagulated patients with pulmonary embolism: insights from the CURES Registry-2. Thromb Haemost. (2025). doi: 10.1055/a-2642-0241
6. Vizzotto, LJH, Sepeda, CDR, and Miranda, CH. Bleeding in patients hospitalized with acute pulmonary embolism in Brazil. Clinics. (2025) 80:100573. doi: 10.1016/j.clinsp.2024.100573
7. Chopard, R, Bertoletti, L, Piazza, G, Jimenez, D, Barillari, G, Llamas, P, et al. External validation of the PE-SARD risk score for predicting early bleeding in acute pulmonary embolism in the RIETE Registry. Thromb Res. (2024) 235:22–31. doi: 10.1016/j.thromres.2024.01.013
8. Badescu, MC, Ciocoiu, M, Badulescu, OV, Vladeanu, MC, Bojan, IB, Vlad, CE, et al. Prediction of bleeding events using the VTE-BLEED risk score in patients with venous thromboembolism receiving anticoagulant therapy (Review). Exp Ther Med. (2021) 22:1344. doi: 10.3892/etm.2021.10779
9. Nishimoto, Y, Yamashita, Y, Morimoto, T, Saga, S, Amano, H, Takase, T, et al. Validation of the VTE-BLEED score's long-term performance for major bleeding in patients with venous thromboembolisms: from the COMMAND VTE registry. J Thromb Haemost. (2020) 18:624–32. doi: 10.1111/jth.14691
10. Kusaba, H, Moriyama, S, Hieda, M, Ito, M, Ohmura, H, Isobe, T, et al. IMPROVE bleeding score predicts major bleeding in advanced gastrointestinal cancer patients with venous thromboembolism. Jpn J Clin Oncol. (2022) 52:1183–90. doi: 10.1093/jjco/hyac103
11. Wang, L, Zhao, L, Li, F, Liu, J, Zhang, L, Li, Q, et al. Risk assessment of venous thromboembolism and bleeding in COVID-19 patients. Clin Respir J. (2022) 16:182–9. doi: 10.1111/crj.13467
12. Grdinic, AG, Radovanovic, S, Gleditsch, J, Jørgensen, CT, Asady, E, Pettersen, HH, et al. Developing a machine learning model for bleeding prediction in patients with cancer-associated thrombosis receiving anticoagulation therapy. J Thromb Haemost. (2024) 22:1094–104. doi: 10.1016/j.jtha.2023.12.034
13. Zhang, L, Ding, YJ, Sun, XW, Lin, YN, Zhou, JP, Li, SQ, et al. Combined aspartate aminotransferase level and PE-SARD score predict 1-month bleeding risk in acute pulmonary embolism. Am J Med Sci. (2023) 366:286–90. doi: 10.1016/j.amjms.2023.07.008
14. Xu, D, Zhou, H, Zhang, T, Gong, W, Zhong, J, Yu, H, et al. Safety of antiplatelet therapy in noncardioembolic ischemic stroke with thrombocytopenia: the CASE II study. J Am Heart Assoc. (2024) 13:e032327. doi: 10.1161/JAHA.123.032327
15. Qian, Y, Wanlin, L, and Maofeng, W. Machine learning derived model for the prediction of bleeding in dual antiplatelet therapy patients. Front Cardiovasc Med. (2024) 11:1402672. doi: 10.3389/fcvm.2024.1402672
16. Chen, T, Lei, W, and Wang, M. Predictive model of internal bleeding in elderly aspirin users using XGBoost machine learning. Risk Manag Healthc Policy. (2024) 17:2255–69. doi: 10.2147/RMHP.S478826
17. Liang, C, Wanling, L, and Maofeng, W. LASSO-derived model for the prediction of bleeding in aspirin users. Sci Rep. (2024) 14:12507. doi: 10.1038/s41598-024-63437-6
18. Jing, J, Wanling, L, and Maofeng, W. A practical nomogram for predicting the bleeding risk in patients with a history of myocardial infarction treating with aspirin. Clin Appl Thromb Hemost. (2024) 30:10760296241262789. doi: 10.1177/10760296241262789
19. Obradovic, S, Subotic, B, Dzudovic, B, Matijasevic, J, Dzudovic, J, Salinger-Martinovic, S, et al. Pulmonary embolism bleeding score index (PEBSI): a new tool for the detection of patients with low risk for major bleeding on thrombolytic therapy. Thromb Res. (2022) 214:138–43. doi: 10.1016/j.thromres.2022.05.002
20. Wang, D, Fan, G, Liu, X, Wu, S, and Zhai, Z. Renal insufficiency and short-term outcomes of acute pulmonary embolism: a systemic review and meta-analysis. Thromb Haemost. (2020) 120:1025–34. doi: 10.1055/s-0040-1712459
21. Wang, D, Fan, G, Lei, J, Yang, Y, Xu, X, Ji, Y, et al. LMWHs dosage and outcomes in acute pulmonary embolism with renal insufficiency, an analysis from a large real-world study. Thromb J. (2022) 20:26. doi: 10.1186/s12959-022-00385-z
22. Ģībietis, V, Kigitoviča, D, Vītola, B, Strautmane, S, and Skride, A. Glomerular filtration rate as a prognostic factor for long-term mortality after acute pulmonary embolism. Med Princ Pract. (2019) 28:264–72. doi: 10.1159/000497436
23. Sen, HS, Abakay, O, Tanrikulu, AC, Sezgi, C, Taylan, M, Abakay, A, et al. Is a complete blood cell count useful in determining the prognosis of pulmonary embolism? Wien Klin Wochenschr. (2014) 126:347–54. doi: 10.1007/s00508-014-0537-1
24. Oblitas, CM, Lago-Rodríguez, MO, López-Rubio, M, García-Gámiz, M, Zamora-Trillo, A, Alvarez-Sala-Walther, LA, et al. Role of cytokines in predicting early major bleeding in patients with acute pulmonary embolism. Eur J Haematol. (2025) 114:847–51. doi: 10.1111/ejh.14387
25. Perini, F, De Boni, A, Marcon, M, Bolgan, I, Pellizzari, M, and Dionisio, LD. Systolic blood pressure contributes to intracerebral haemorrhage after thrombolysis for ischemic stroke. J Neurol Sci. (2010) 297:52–4. doi: 10.1016/j.jns.2010.06.025
26. Markus, HS, and de Leeuw, FE. Cerebral small vessel disease: recent advances and future directions. Int J Stroke. (2023) 18:4–14. doi: 10.1177/17474930221144911
Keywords: pulmonary embolism, bleeding, risk prediction model, nomogram, adverse effects, anticoagulants
Citation: Ye T, Lei W, Wang M and Xu L (2025) Development and validation of a nomogram for predicting bleeding risk in patients with pulmonary embolism. Front. Med. 12:1692156. doi: 10.3389/fmed.2025.1692156
Edited by:
Francisco Epelde, Parc Taulí Foundation, SpainCopyright © 2025 Ye, Lei, Wang and Xu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Maofeng Wang, d3ptY3dtZkB3bXUuZWR1LmNu; Lili Xu, eHVsaWxpNDE2NjNAMTYzLmNvbQ==
†These authors have contributed equally to this work
Tian Ye1†