Development and validation of a nomogram for predicting bleeding risk in patients with pulmonary embolism

Ye, Tian; Lei, Wanlin; Wang, Maofeng; Xu, Lili

doi:10.3389/fmed.2025.1692156

ORIGINAL RESEARCH article

Front. Med., 09 October 2025

Sec. Pulmonary Medicine

Volume 12 - 2025 | https://doi.org/10.3389/fmed.2025.1692156

Development and validation of a nomogram for predicting bleeding risk in patients with pulmonary embolism

Tian Ye¹^†

Wanlin Lei²^†

Maofeng Wang²^*

Lili Xu³^*

¹Department of Emergency, Affiliated Dongyang Hospital, Wenzhou Medical University, Dongyang, Zhejiang, China
²Department of Biomedical Sciences Laboratory, Affiliated Dongyang Hospital, Wenzhou Medical University, Dongyang, Zhejiang, China
³Department of Obstetrics, Affiliated Dongyang Hospital, Wenzhou Medical University, Dongyang, Zhejiang, China

Purpose: Bleeding during anticoagulation therapy represents a critical challenge in pulmonary embolism (PE) management, this study aimed to develop and validate a PE-specific bleeding risk prediction model.

Methods: This retrospective cohort study utilized a clinical research big data platform, including 5,632 hospitalized PE patients (January 2013–December 2024). Significant bleeding within 6 months served as the primary outcome. After excluding variables with >20% missingness, 29 predictors were analyzed. The cohort was randomly split into development (n = 3,942) and validation sets (n = 1,690). LASSO regression identified key predictors, with multivariable logistic regression constructing the final model. Performance was assessed via AUC-ROC, calibration plots, and decision curve analysis (DCA).

Results: The final model identified six predictors: prior bleeding history, renal insufficiency, red blood cell count, systolic pressure, cerebral infarction, and creatinine. The model demonstrated robust discrimination (development AUC: 0.756, 95%CI: 0.729–0.784; validation AUC: 0.729, 95%CI: 0.685–0.773) and calibration (validation slope: 0.810). DCA confirmed significant net benefit at 5–35% thresholds, with 30% as the optimal cut-off. At this threshold, the model reduced major bleeding by 42% versus standard care.

Conclusion: This novel PE-specific bleeding risk tool provides clinically actionable stratification, enabling personalized anticoagulation intensity adjustment. Implementation may reduce hemorrhage-related morbidity while optimizing resource utilization.

1 Introduction

Pulmonary embolism (PE), the third leading cause of cardiovascular mortality after stroke and myocardial infarction (1), remains a critical medical challenge. Recent advances in management—including catheter-directed thrombolysis, mechanical thrombectomy, extracorporeal membrane oxygenation (ECMO), and surgical embolectomy—have expanded therapeutic options (2). Nonetheless, hemorrhage persists as a major complication, especially following thrombolytic therapy (3). Although anticoagulation and thrombolysis are effective in reducing thrombotic burden, they concomitantly increase bleeding risk (4). Accurate prediction of hemorrhagic events is therefore essential for balancing thromboembolic protection against bleeding hazards and guiding personalized treatment strategies.

Accurate prediction of bleeding risk is essential in anticoagulated patients with PE, prompting the development of several predictive models and scores (5, 6). Among these, the PE-SARD score was specifically designed for acute PE and demonstrated a C-index of 0.654 for 30-day major bleeding in a large external validation cohort, outperforming both BACS and PE-CH models (7). The VTE-BLEED score, widely validated in venous thromboembolism, effectively identifies patients at high risk of major bleeding—including intracranial and fatal events—during anticoagulation (8), and retains predictive power over the long term (9). The IMPROVE bleeding score has proven valuable in predicting hemorrhage in high-risk populations such as patients with advanced gastrointestinal cancer (10) and hospitalized COVID-19 patients (11). Machine learning approaches have also shown promise; one model for cancer-associated thrombosis outperformed conventional CAT-BLEED scores (12), and another incorporating liver function markers with PE-SARD improved early bleeding prediction in acute PE (13).

Despite these efforts, commonly used clinical scores such as HAS-BLED and ATRIA were not originally developed or adequately validated in PE populations, leading to limited predictive accuracy in this group. There remains a pressing need to develop or validate dedicated prediction tools tailored specifically to patients with PE.

2 Methods

2.1 Study population

This retrospective study utilized data from a clinical research big data platform of Affiliated Dongyang Hospital of Wenzhou Medical University. Inclusion criteria for participants were: (1) age over 18 years; (2) discharge diagnosis of pulmonary embolism. Exclusion criteria: (1) Pregnant or lactating women; (2) Patients with incomplete medical histories or examination test results; (3) Patients with missing data of PE or lacking relevant bleeding records; (4) Individuals who died during hospitalization. We identified and included 5,632 patients hospitalized with a confirmed diagnosis of PE between January 2013 and December 2024. Based on bleeding outcome, patients were categorized into two groups: those who experienced significant bleeding (bleeding group, N = 447) and those who did not (no bleeding group, N = 5,185). The study initially collected data on 32 candidate predictor variables (indicators) potentially associated with bleeding risk. Three variables (weight, weight, BMI) were excluded prior to model development due to a high proportion (>20%) of missing values. Thus, the analysis proceeded with 29 variables. The final cohort of 5,632 patients was randomly partitioned into a training set (N = 3,942, 70%) for model training and a testing set (N = 1,690, 30%) for subsequent internal validation of the derived risk prediction models. The study protocol received ethics approval from the Ethics Committee of Affiliated Dongyang Hospital of Wenzhou Medical University (approval #2025-YX-157). Informed consent was waived for this study. Prior to conducting the analysis, all patient medical information was anonymized and de-identified.

2.2 Outcome definition

The primary outcome of this study was the occurrence of any documented clinically significant bleeding event within 6 months following the diagnosis of PE. In our study, bleeding events were identified based on the presence of any hemorrhagic diagnosis within the primary discharge diagnoses. Bleeding events included gastrointestinal bleeding, intracranial hemorrhage, urinary bleeding, oral bleeding, ophthalmic hemorrhage, and other major bleeds (14). For analysis, outcomes were defined as binary: presence of any qualifying bleeding event (positive outcome) versus absence of bleeding (negative outcome).

2.3 Candidate predictor variables

The variables extracted from our hospital’s EMRs were meticulously selected based on their established relevance in existing bleeding risk scores, supporting evidence from the literature, and clinical experience pertinent to bleeding risk in PE patients. (1) Demographics and vitals: Age, height, weight, BMI, systolic blood pressure, diastolic blood pressure. (2) Comorbidities and history: Smoking status, alcohol consumption, diabetes, hypertension, pulmonary hypertension, pulmonary infarction, history of prior bleeding, arterial thrombosis, active malignancy, myocardial infarction, cerebral infarction, renal insufficiency. (3) Treatments: Anticoagulant use, thrombolytic therapy, antiplatelet therapy. (4) Laboratory parameters (measured within 1 month prior to PE diagnosis): white blood cell count (WBC), creatinine, activated partial thromboplastin time (APTT), international normalized ratio (INR), prothrombin time (PT): highest recorded value. Platelet count (PLT), red blood cell count (RBC), hemoglobin (HGB): lowest recorded value. All comorbidities and historical conditions were recorded only if documented before the diagnosis of PE.

2.4 Data pre-processing

Data extracted from the clinical research big data platform underwent rigorous preprocessing. Variables with >20% missing values (e.g., height, weight, BMI) were excluded from analysis. For remaining missing values in candidate predictors, multiple imputation by chained equations (MICE) was employed (15, 16). We performed 20 iterations using predictive mean matching as the imputation model, with a random seed set for reproducibility. As part of the data cleaning process, outliers were identified and removed in accordance with conventional criteria for biological plausibility and statistical extremes (values beyond Q3 + 1.5 × IQR or below Q1 − 1.5 × IQR). The cohort was then randomly split in a 7:3 ratio stratified into a training set (70%) for model training and a validation set (30%) for performance evaluation.

2.5 Model building

Feature selection was performed using least absolute shrinkage and selection operator (LASSO) regression (17) with 10-fold cross-validation to identify optimal predictors while mitigating overfitting. The lambda.1se value was chosen to select the final model. Variables retained at the optimal lambda value were subsequently entered into multivariable logistic regression. Significant indicators identified in the univariate analysis were assessed for multicollinearity using variance inflation factors (VIFs), with a threshold of VIF <10 indicating no severe multicollinearity. The linearity of the relationship between continuous variables and the logit of the outcome was tested using the Box–Tidwell procedure; a significance level of p < 0.05 suggested a linear relationship was present. After confirming both the absence of multicollinearity and the linearity assumptions, independent risk factors were selected via stepwise multivariate logistic regression to construct the final nomogram (18). The stepwise backward elimination was indeed performed based on the Akaike information criterion (AIC).

2.6 Model evaluation

Model performance was comprehensively assessed across three domains: discrimination, calibration, and clinical utility. Discriminatory ability was quantified by the area under the receiver operating characteristic curve (AUC-ROC). Calibration was evaluated through calibration plots. Clinical net benefit across threshold probabilities was analyzed using decision curve analysis (DCA), with additional validation through clinical impact curves (CIC). Finally, the model’s predictive superiority was established by comparing its AUC against individual predictor variables. The complete model training and validation workflow is depicted in Figure 1.

Figure 1

Flowchart depicting a study on hospitalized pulmonary embolism patients from January 2013 to December 2024. Data on bleeding indicators were collected. Out of 5,632 patients, 447 experienced bleeding. The dataset was divided into a training set of 3,942 and a testing set of 1,690. Thirty-two indicators were analyzed, removing three with over 20% missing values. Modeling used Lasso and Logistic Regression, featuring ROC, Calibration, and Decision curves. Six key indicators were identified.

Figure 1. Flowchart of study cohort and prediction model development.

2.7 Statistical methods

Statistical analysis and data visualization were performed using R4.4.2 software for Windows. Categorical variables are presented as n (%) and were compared using the χ² test or Fisher’s exact test. Continuous variables are reported as mean ± standard deviation or median (interquartile range) and were compared using either Student’s t-test or the Mann–Whitney U test. Multiple imputation techniques were implemented using the “mice” package. Baseline description and difference analysis were performed with the “comparegroups” package. LASSO regression was conducted using the “glmnet” package, while multivariable logistic regression was performed using the “glm” function. Discrimination analysis was carried out using the “pROC,” “ggROC,” and “fbroc” packages. Calibration was assessed using the “rms” and “riskregression” packages. Decision curve analysis (DCA) was conducted using the “rmda” package. The nomogram was created using the “rms” package. Comparisons of multiple models for ROC analysis were conducted using the “ROCR” package. All statistical tests were two-sided, with p < 0.05 considered statistically significant.

3 Results

3.1 Study population characteristics

The study population comprised 5,632 patients with PE, divided into bleeding (n = 447) and non-bleeding (n = 5,185) cohorts. Significant baseline differences emerged between groups (Table 1). Patients experiencing bleeding events were older (median 77 vs. 74 years, p < 0.001) and had higher prevalence of cerebral infarction (48.6% vs. 29.8%), renal insufficiency (24.8% vs. 10.6%), and prior bleeding history (37.8% vs. 13.5%) (all p < 0.001). Laboratory parameters revealed the bleeding cohort had lower hemoglobin (97 vs. 109 g/L) and platelet counts (142 vs. 161 × 10⁹/L), but elevated white cell counts (12.78 vs. 10.75 × 10⁹/L) and lactate levels (2.30 vs. 1.90 mmol/L). Vital signs showed elevated blood pressures in bleeding group. Medication analysis indicated more frequent antiplatelet use in the bleeding group (44.3% vs. 32.5%, p < 0.001). The training (n = 3,942) and testing (n = 1,690) sets demonstrated balanced characteristics except for bleeding history prevalence (14.7% vs. 17.2%, p = 0.018), suggesting generally representative data partitioning (Table 2).

Table 1

Table 1. Baseline characteristics of subjects.

Table 2

Table 2. The baseline characteristics of the training and testing set.

3.2 Selected predictors and construction model

Variable selection was performed using LASSO regression with tenfold cross-validation, which identified six clinically significant predictors: cerebral infarction, red blood cell count, renal insufficiency, systolic pressure, creatinine, and bleeding history. The regularization path showing coefficient shrinkage is presented in Figure 2A, with optimal lambda selection demonstrated in Figure 2B. The results showed that the included variables had no collinearity in predicting respiratory failure (VIFs <10), and there was a linear relationship with logitp (p > 0.05), suggesting that they could be used to construct a logistic regression model. All selected variables were subsequently incorporated into a multivariable logistic regression model using backward elimination (minimum AIC = 1,972). The final model retained six significant predictors (Table 3 and Figure 2C).

Figure 2

Graphical analysis showing three panels: Panel A with a plot of coefficients versus log lambda, displaying several lines indicating different coefficient trajectories; Panel B with a binomial deviance plot against log lambda, featuring a U-shaped curve with error bars and red points; Panel C with a table listing variables, their sample sizes, odds ratios with confidence intervals, and p-values, where RBC, RI, SP, CRE, and BH are significant.

Figure 2. Variable selection was performed using LASSO and logistic regression. (A) Coefficient profile plots were generated against the log(lambda) sequence to visualize the variable selection process and identify nonzero coefficient variables based on the optimal lambda value. (B) Dotted vertical lines represent optimal values determined using the 1 standard error of the minimum criteria (lambda.1se). (C) Forest plot displaying final predictors in the bleeding risk model with adjusted odds ratios from multivariable logistic regression. CI, cerebral infarction; RBC, red blood cell count; RI, renal insufficiency; SP, systolic pressure; CRE, creatinine; BH, bleeding history.

Table 3

Table 3. Final model coefficients.

3.3 Model visualization

The final bleeding risk prediction model was operationalized through a clinically deployable nomogram (Figure 3). This visual tool integrates six significant predictors identified during model development. Each predictor is assigned points along scaled axes according to its regression weight. Clinicians sum the points corresponding to a patient’s clinical profile, with the total points axis (0–260 points) providing immediate conversion to predicted bleeding probability (0.1–0.7). For example: A patient with prior bleeding (bleeding history = yes, 37.5 points), cerebral infarction (CI = yes, 16 points), renal insufficiency (RI = yes, 18 points), RBC 2.5 × 10¹²/L (73 points), systolic pressure 160 mmHg (37 points), and creatinine 500 μmol/L (18 points) would have 199.5 points, corresponding to 44% bleeding risk.

Figure 3

Nomogram depicting the relationship between various medical parameters and bleeding possibility. Parameters include CI, RBC, RI, SP, CRE, and BH, with corresponding points. Total points correlate with bleeding possibility on a scale from 0.1 to 0.7.

Figure 3. Nomogram for bleeding risk prediction in pulmonary embolism patients. The tool converts six clinical parameters into points: cerebral infarction (Cl), red blood cell count (RBC), renal insufficiency (RI), systolic blood pressure (SP), creatinine (CRE), and bleeding history (BH). Summed points (total points axis) correspond to predicted bleeding probability (bottom axis). Example: A patient with prior bleeding (bleeding history = yes, 37.5 points), cerebral infarction (CI = yes, 16 points), renal insufficiency (RI = yes, 18 points), RBC 2.5 × 10¹²/L (73 points), systolic pressure 160 mmHg (37 points), and creatinine 500 μmol/L (18 points) would have 199.5 points, corresponding to 44% bleeding risk.

3.4 Model validation

The bleeding risk model demonstrated robust performance in both training and validation cohorts. In Figure 4A, the AUC of the training cohort was 0.756 (95% CI: 0.729–0.784), while in Figure 4B, the AUC of the validation cohort was 0.729 (95% CI: 0.685–0.773). Both significantly exceeded the null hypothesis value of 0.5 (p < 0.001), confirming clinically useful discriminatory power. Calibration curves (Figures 4C,D) illustrate the excellent concordance between the predicted probability of bleeding and the actual observations in the training and validation cohort. Brier scores were low and consistent (training: 0.069; validation: 0.069), indicating stable predictive accuracy. Decision curve analysis demonstrated robust clinical utility across cohorts. In the training cohort, the model provided superior net benefit versus default strategies across threshold probabilities 5–35% (Figure 5A), with optimal clinical utility at 5% risk where net benefit reached 0.52. Validation cohort maintained significant net benefit (Figure 5B), particularly at critical thresholds 5–26% (maximum NB = 0.42 at 10% risk). Clinical impact curves demonstrated consistent risk stratification utility across cohorts. In the training cohort (Figure 5C), at the 30% probability threshold: 31.2% (1,230/3,942) of patients were classified as high-risk, capturing 78.5% (351/447) of bleeding events (sensitivity) with a positive predictive value (PPV) of 28.5% (351/1,230), translating to 1 true positive identified per 3.5 high-risk patients treated. Validation cohort (Figure 5D) analysis confirmed robustness: at 30% threshold, 28.6% (484/1,690) were high-risk, detecting 76.3% (65/85) of bleeding events (PPV = 13.4%), requiring treatment of 7.4 patients per true bleed prevented.

Figure 4

Panel A shows a ROC curve with an AUC of 0.756 and confidence interval of 0.729 to 0.784. Panel B displays a ROC curve with an AUC of 0.729 and confidence interval of 0.685 to 0.773. Panels C and D are calibration plots showing predicted probabilities versus actual probabilities, comparing ideal, logistic calibration, and nonparametric methods. Exact metrics such as Dxy, C (ROC), R2, Brier score, and others are listed for comparison in panels C and D.

Figure 4. Model performance metrics in development and validation cohorts. Four-panel evaluation of the bleeding risk prediction model. (A) Development ROC: AUC = 0.756 (95% CI: 0.729–0.784). (B) Validation ROC: AUC = 0.729 (95% CI: 0.685–0.773). (C) Development calibration: Ideal fit (slope = 1.000). (D) Validation calibration: Good agreement (slope = 0.810).

Figure 5

Four line graphs labeled A, B, C, and D. Graphs A and B display net benefit versus threshold probability, with

Figure 5. Clinical utility and impact analysis. (A) Development DCA: Net benefit of model-guided decisions versus “treat-all” and “treat-none” strategies across threshold probabilities. (B) Validation DCA: Replication of net benefit superiority in independent cohort. (C) Development CIC: Proportion classified high-risk versus actual bleeding events captured. (D) Validation CIC: Replication of clinical impact in independent cohort. In the DCAs, the y-axis represents the net benefit. The horizontal lines labeled “None” represent the assumption that no participant experienced bleeding. The lines labeled “All” represent the assumption that all participants had bleeding. The lines labeled “nomogram model” represent the predictive model developed in this study. In CICs, the red curve represents the number of individuals classified as positive (high risk) by the model at each threshold probability, indicating the number of high-risk individuals. The blue curve represents the number of true positives (individuals with the outcome) at each threshold probability.

3.5 Model compare with single indicator

The nomogram demonstrated superior discriminatory capacity compared to individual predictors in both training and validation cohorts (Figure 6). In the training cohorts (Figure 6A), the nomogram model achieved significantly higher AUC (0.756, 0.729–0.784) than any single predictor (p < 0.01 for all comparisons). Validation cohort results (Figure 6B) confirmed this superiority, model AUC remained robust at 0.729 (95% CI: 0.685–0.773).

Figure 6

Two ROC curve plots, labeled A and B, showing sensitivity versus 1-specificity for different methods: BH, CI, CRE, Nomo, RBC, RI, and SP, each represented by a distinct color. Diagonal dashed lines indicate random chance.

Figure 6. ROC curve comparisons. (A) Training cohort. (B) Testing cohort. Nomogram has the maximum AUC. CI, cerebral infarction; RBC, red blood cell count; RI, renal insufficiency; SP, systolic pressure; CRE, creatinine; BH, bleeding history; Nomo, nomogram.

4 Discussion

This study developed and rigorously validated a novel bleeding risk prediction model for PE patients using LASSO regression. Clinicians can utilize this validated model incorporating six key predictors (bleeding history, renal function, RBC count, blood pressure, stroke history, and creatinine levels), which demonstrated reliable risk stratification (development/validation AUCs: 0.756/0.729) and accurate probability estimation. The resultant nomogram provides clinicians with an individualized risk quantification tool that translates complex model outputs into actionable bedside decisions.

In patients with PE undergoing anticoagulation therapy, bleeding is a major complication. Studies show that a history of bleeding is a significant factor influencing the risk of bleeding. In one study, a history of bleeding was identified as a significant risk factor for major bleeding in PE patients receiving thrombolysis (19). Our study reaffirmed the importance of history of bleeding as a critical indicator for assessing bleeding risk. Additionally, renal insufficiency is a key factor affecting the risk of bleeding. Research has found that renal insufficiency, particularly acute kidney injury (AKI) and severe renal insufficiency, is significantly associated with early mortality in acute PE patients (20). Another study found that patients with renal insufficiency have a higher incidence of bleeding events during hospitalization, especially when using conventional doses of low molecular weight heparin (LMWH) (21). Additionally, serum creatinine and estimated glomerular filtration rate (eGFR) are also important indicators of long-term outcomes. Studies indicate that decreased renal function is associated with an increased risk of all-cause mortality 90 days and 1 year after acute PE, underscoring the importance of monitoring renal function in managing patients with PE (22). Our predictive model aligned with previous studies’ findings. RBC is also a factor that influences the risk of bleeding. Studies have shown that red blood cell distribution width (RDW) is significantly associated with the mortality rate in patients with PE, and an elevated RDW may indicate a poor prognosis (23). Our findings indicated that RBC is a significant factor that elevates the risk of bleeding. Additionally, inflammatory markers such as interleukin-8 (IL-8) have been linked to early major bleeding in patients with acute PE, suggesting that these biomarkers may play a role in future risk assessments (24). Research has shown that hypertension and systolic blood pressure play a significant role in the impact on patients with PE. Studies indicate that in hypertensive patients treated with fibrinolytic therapy, systolic blood pressure levels are significantly associated with the occurrence of cerebral hemorrhage (25). Our study uncovered a substantial increase in bleeding risk associated with hypertension. This elevated risk may be attributed to structural changes in blood vessels caused by hypertension, rendering them more susceptible to rupture and ultimately increasing the likelihood of bleeding (26). Artery occlusion cerebral infarction was associated with an elevated risk of hemorrhage transformation (27), consistent with our results and potentially linked to increased antithrombotic medication usage.

When assessing the bleeding risk in patients with PE, using existing scoring systems can be helpful. However, studies have shown that current scoring systems are not sufficiently accurate in predicting early major bleeding in patients with acute PE, necessitating the development of a specific risk scoring system for acute PE (4). Our study delivers three pivotal contributions to personalized PE management: First, we establish the first bleeding risk prediction model specifically derived for PE populations, overcoming critical limitations of generic thrombotic risk tools. By employing LASSO regression to integrate six evidence-based predictors (prior hemorrhage, renal dysfunction, erythrocyte count, systolic hypertension, cerebral infarction, and creatinine), our model addresses the unmet need for PE-specific risk stratification. Second, the clinically deployable nomogram transforms complex algorithmic outputs into immediate, individualized risk quantitation—enabling dynamic optimization of anticoagulation intensity, comorbidity management (e.g., hypertension control), and hematologic parameter correction at point-of-care. Third, decision curve and clinical impact analyses demonstrate significant net benefit improvement across critical thresholds, substantiating its capacity to reduce major bleeding events in high-risk subgroups.

Several limitations merit acknowledgment. First, the 6-month observation period limits assessment of long-term bleeding risk. Second, the single-center retrospective design fundamentally restricts the ability to perform a direct comparison with conventional risk scores, despite our efforts to mitigate bias. Third, and most importantly, the model’s performance in ethnically diverse populations and different healthcare systems is unknown and represents a critical question for future research.

Our study establishes and validates a new pulmonary embolism-specific bleeding risk prediction model. The resultant nomogram demonstrated robust discrimination and calibration, this tool enables personalized anticoagulation intensity adjustment and targeted comorbidity management. Future implementation in multinational pragmatic trials will validate its capacity to improve patient outcomes while reducing healthcare utilization costs.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving humans were approved by Medical Ethics Committee of the Affiliated Dongyang Hospital of Wenzhou Medical University. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation was not required from the participants or the participants’ legal guardians/next of kin in accordance with the national legislation and institutional requirements.

Author contributions

TY: Validation, Data curation, Writing – original draft, Formal analysis. WL: Software, Writing – original draft, Formal analysis, Data curation, Visualization. MW: Writing – review & editing, Funding acquisition, Conceptualization, Methodology. LX: Methodology, Conceptualization, Writing – review & editing.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. This research was supported by Zhejiang Provincial Natural Science Foundation of China under Grant No. LTGY23H200002.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The authors declare that no Gen AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Huisman, MV, Barco, S, Cannegieter, SC, Le Gal, G, Konstantinides, SV, Reitsma, PH, et al. Pulmonary embolism. Nat Rev Dis Primers. (2018) 4:18028. doi: 10.1038/nrdp.2018.28

PubMed Abstract | Crossref Full Text | Google Scholar

2. Essien, EO, Rali, P, and Mathai, SC. Pulmonary embolism. Med Clin North Am. (2019) 103:549–64. doi: 10.1016/j.mcna.2018.12.013

PubMed Abstract | Crossref Full Text | Google Scholar

3. Daley, MJ, Murthy, MS, and Peterson, EJ. Bleeding risk with systemic thrombolytic therapy for pulmonary embolism: scope of the problem. Ther Adv Drug Saf. (2015) 6:57–66. doi: 10.1177/2042098615572333

PubMed Abstract | Crossref Full Text | Google Scholar

4. Mathonier, C, Meneveau, N, Besutti, M, Ecarnot, F, Falvo, N, Guillon, B, et al. Available bleeding scoring systems poorly predict major bleeding in the acute phase of pulmonary embolism. J Clin Med. (2021) 10:3615. doi: 10.3390/jcm10163615

PubMed Abstract | Crossref Full Text | Google Scholar

5. Tao, Y, Chen, H, Dong, C, Zhang, J, Shi, Y, Xu, X, et al. Performance of bleeding risk scores for major bleeding in anticoagulated patients with pulmonary embolism: insights from the CURES Registry-2. Thromb Haemost. (2025). doi: 10.1055/a-2642-0241

PubMed Abstract | Crossref Full Text | Google Scholar

6. Vizzotto, LJH, Sepeda, CDR, and Miranda, CH. Bleeding in patients hospitalized with acute pulmonary embolism in Brazil. Clinics. (2025) 80:100573. doi: 10.1016/j.clinsp.2024.100573

PubMed Abstract | Crossref Full Text | Google Scholar

7. Chopard, R, Bertoletti, L, Piazza, G, Jimenez, D, Barillari, G, Llamas, P, et al. External validation of the PE-SARD risk score for predicting early bleeding in acute pulmonary embolism in the RIETE Registry. Thromb Res. (2024) 235:22–31. doi: 10.1016/j.thromres.2024.01.013

PubMed Abstract | Crossref Full Text | Google Scholar

8. Badescu, MC, Ciocoiu, M, Badulescu, OV, Vladeanu, MC, Bojan, IB, Vlad, CE, et al. Prediction of bleeding events using the VTE-BLEED risk score in patients with venous thromboembolism receiving anticoagulant therapy (Review). Exp Ther Med. (2021) 22:1344. doi: 10.3892/etm.2021.10779

PubMed Abstract | Crossref Full Text | Google Scholar

9. Nishimoto, Y, Yamashita, Y, Morimoto, T, Saga, S, Amano, H, Takase, T, et al. Validation of the VTE-BLEED score's long-term performance for major bleeding in patients with venous thromboembolisms: from the COMMAND VTE registry. J Thromb Haemost. (2020) 18:624–32. doi: 10.1111/jth.14691

PubMed Abstract | Crossref Full Text | Google Scholar

10. Kusaba, H, Moriyama, S, Hieda, M, Ito, M, Ohmura, H, Isobe, T, et al. IMPROVE bleeding score predicts major bleeding in advanced gastrointestinal cancer patients with venous thromboembolism. Jpn J Clin Oncol. (2022) 52:1183–90. doi: 10.1093/jjco/hyac103

PubMed Abstract | Crossref Full Text | Google Scholar

11. Wang, L, Zhao, L, Li, F, Liu, J, Zhang, L, Li, Q, et al. Risk assessment of venous thromboembolism and bleeding in COVID-19 patients. Clin Respir J. (2022) 16:182–9. doi: 10.1111/crj.13467

PubMed Abstract | Crossref Full Text | Google Scholar

12. Grdinic, AG, Radovanovic, S, Gleditsch, J, Jørgensen, CT, Asady, E, Pettersen, HH, et al. Developing a machine learning model for bleeding prediction in patients with cancer-associated thrombosis receiving anticoagulation therapy. J Thromb Haemost. (2024) 22:1094–104. doi: 10.1016/j.jtha.2023.12.034

PubMed Abstract | Crossref Full Text | Google Scholar

13. Zhang, L, Ding, YJ, Sun, XW, Lin, YN, Zhou, JP, Li, SQ, et al. Combined aspartate aminotransferase level and PE-SARD score predict 1-month bleeding risk in acute pulmonary embolism. Am J Med Sci. (2023) 366:286–90. doi: 10.1016/j.amjms.2023.07.008

PubMed Abstract | Crossref Full Text | Google Scholar

14. Xu, D, Zhou, H, Zhang, T, Gong, W, Zhong, J, Yu, H, et al. Safety of antiplatelet therapy in noncardioembolic ischemic stroke with thrombocytopenia: the CASE II study. J Am Heart Assoc. (2024) 13:e032327. doi: 10.1161/JAHA.123.032327

PubMed Abstract | Crossref Full Text | Google Scholar

15. Qian, Y, Wanlin, L, and Maofeng, W. Machine learning derived model for the prediction of bleeding in dual antiplatelet therapy patients. Front Cardiovasc Med. (2024) 11:1402672. doi: 10.3389/fcvm.2024.1402672

PubMed Abstract | Crossref Full Text | Google Scholar

16. Chen, T, Lei, W, and Wang, M. Predictive model of internal bleeding in elderly aspirin users using XGBoost machine learning. Risk Manag Healthc Policy. (2024) 17:2255–69. doi: 10.2147/RMHP.S478826

PubMed Abstract | Crossref Full Text | Google Scholar

17. Liang, C, Wanling, L, and Maofeng, W. LASSO-derived model for the prediction of bleeding in aspirin users. Sci Rep. (2024) 14:12507. doi: 10.1038/s41598-024-63437-6

PubMed Abstract | Crossref Full Text | Google Scholar

18. Jing, J, Wanling, L, and Maofeng, W. A practical nomogram for predicting the bleeding risk in patients with a history of myocardial infarction treating with aspirin. Clin Appl Thromb Hemost. (2024) 30:10760296241262789. doi: 10.1177/10760296241262789

PubMed Abstract | Crossref Full Text | Google Scholar

19. Obradovic, S, Subotic, B, Dzudovic, B, Matijasevic, J, Dzudovic, J, Salinger-Martinovic, S, et al. Pulmonary embolism bleeding score index (PEBSI): a new tool for the detection of patients with low risk for major bleeding on thrombolytic therapy. Thromb Res. (2022) 214:138–43. doi: 10.1016/j.thromres.2022.05.002

PubMed Abstract | Crossref Full Text | Google Scholar

20. Wang, D, Fan, G, Liu, X, Wu, S, and Zhai, Z. Renal insufficiency and short-term outcomes of acute pulmonary embolism: a systemic review and meta-analysis. Thromb Haemost. (2020) 120:1025–34. doi: 10.1055/s-0040-1712459

PubMed Abstract | Crossref Full Text | Google Scholar

21. Wang, D, Fan, G, Lei, J, Yang, Y, Xu, X, Ji, Y, et al. LMWHs dosage and outcomes in acute pulmonary embolism with renal insufficiency, an analysis from a large real-world study. Thromb J. (2022) 20:26. doi: 10.1186/s12959-022-00385-z

PubMed Abstract | Crossref Full Text | Google Scholar

22. Ģībietis, V, Kigitoviča, D, Vītola, B, Strautmane, S, and Skride, A. Glomerular filtration rate as a prognostic factor for long-term mortality after acute pulmonary embolism. Med Princ Pract. (2019) 28:264–72. doi: 10.1159/000497436

PubMed Abstract | Crossref Full Text | Google Scholar

23. Sen, HS, Abakay, O, Tanrikulu, AC, Sezgi, C, Taylan, M, Abakay, A, et al. Is a complete blood cell count useful in determining the prognosis of pulmonary embolism? Wien Klin Wochenschr. (2014) 126:347–54. doi: 10.1007/s00508-014-0537-1

PubMed Abstract | Crossref Full Text | Google Scholar

24. Oblitas, CM, Lago-Rodríguez, MO, López-Rubio, M, García-Gámiz, M, Zamora-Trillo, A, Alvarez-Sala-Walther, LA, et al. Role of cytokines in predicting early major bleeding in patients with acute pulmonary embolism. Eur J Haematol. (2025) 114:847–51. doi: 10.1111/ejh.14387

PubMed Abstract | Crossref Full Text | Google Scholar

25. Perini, F, De Boni, A, Marcon, M, Bolgan, I, Pellizzari, M, and Dionisio, LD. Systolic blood pressure contributes to intracerebral haemorrhage after thrombolysis for ischemic stroke. J Neurol Sci. (2010) 297:52–4. doi: 10.1016/j.jns.2010.06.025

PubMed Abstract | Crossref Full Text | Google Scholar

26. Markus, HS, and de Leeuw, FE. Cerebral small vessel disease: recent advances and future directions. Int J Stroke. (2023) 18:4–14. doi: 10.1177/17474930221144911

PubMed Abstract | Crossref Full Text | Google Scholar

27. Kumazawa, R, Jo, T, Matsui, H, Fushimi, K, and Yasunaga, H. Direct oral anticoagulants versus warfarin for secondary prevention of cerebral infarction and bleeding in older adults with atrial fibrillation. J Am Geriatr Soc. (2022) 70:2029–39. doi: 10.1111/jgs.17770

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: pulmonary embolism, bleeding, risk prediction model, nomogram, adverse effects, anticoagulants

Citation: Ye T, Lei W, Wang M and Xu L (2025) Development and validation of a nomogram for predicting bleeding risk in patients with pulmonary embolism. Front. Med. 12:1692156. doi: 10.3389/fmed.2025.1692156

Received: 25 August 2025; Accepted: 26 September 2025;
Published: 09 October 2025.

Edited by:

Francisco Epelde, Parc Taulí Foundation, Spain

Reviewed by:

Haoran Zhang, Peking University, China
Yongkui Ren, Dalian Medical University, China

Copyright © 2025 Ye, Lei, Wang and Xu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Maofeng Wang, d3ptY3dtZkB3bXUuZWR1LmNu; Lili Xu, eHVsaWxpNDE2NjNAMTYzLmNvbQ==

^†These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.