- 1Department of Pharmacy, Personalized Drug Research and Therapy Key Laboratory of Sichuan Province, Sichuan Provincial People’s Hospital, School of Medicine, University of Electronic Science and Technology of China, Chengdu, China
- 2The Fourth People’s Hospital of Chengdu, Chengdu, China
- 3Department of Pharmacy, Chengdu Women’s and Children’s Central Hospital, School of Medicine, University of Electronic Science and Technology of China, Chengdu, China
Objective: This study utilizes real-world data from primary membranous nephropathy (PMN) patients to preliminarily develop a venous thromboembolism (VTE) risk prediction model with machine learning. The aim is to improve the rational use of prophylactic anticoagulant therapy by predicting VTE risk in these patients.
Methods: We collected diagnostic and treatment data for PMN patients hospitalized at Sichuan Provincial People’s Hospital from 1 January 2018, to 30 September 2024. The data was divided into training and test sets at an 8:2 ratio, followed by processed using combinations of three imputation methods, three sampling methods, and three feature selection methods. After preprocessing, fourteen machine learning algorithms were employed to develop a predictive model for VTE risk in PMN patients. The SHapley Additive exPlanation (SHAP) method was used to interpret the contribution of outcome features. Finally, a VTE risk prediction tool for PMN patients was constructed using Streamlit.
Results: A total of 643 patients with PMN were included in the study, of whom 93 developed VTE. Among the 504 models constructed, the NGBoost model, which incorporated imputation by K-Nearest Neighbor, sampling by Borderline-SMOTE, and feature selection by Frequency-based Selection, was identified as the optimal model, achieving an area under the curve (AUC) of 0.911. The optimal model included ten features: D-dimer (DD), Fibrin Degradation Products (FDP)>5 mg/L, international normalized ratio (INR) of prothrombin, Recurrent nephrotic syndrome (RNS), cholinesterase (CHE), Urinary Microalbumin to Creatinine Ratio (umALB/Ucr), statins, antithrombin III (AT III) activity, albumin, and anti-phospholipase A2 receptor antibody (aPLA2Rab). Finally, an online predictive tool based on the optimal model was developed to provide real-time individualized VTE risk predictions for PMN patients.
Conclusion: This study developed a personalized risk prediction model for VTE in PMN patients using machine learning techniques. Additionally, a web-based tool for this predictive model was created. The model demonstrates strong predictive performance and can assist in clinical decision-making for the prevention and treatment of VTE in PMN patients.
1 Introduction
Membranous Nephropathy (MN) is primarily characterized by proteinuria and hypoalbuminemia, typically progressing to Nephrotic Syndrome (NS). MN can be classified into primary membranous nephropathy (PMN) and secondary membranous nephropathy (SMN) based on etiology, with PMN accounting for 80% of cases (Couser, 2017; Ronco and Debiec, 2015; Dc and Pe, 2017). In China, the proportion of PMN among primary glomerulonephritis cases has risen from 9.89% (1979–2002) to 18.42% (2003–2014), indicating a yearly increase in incidence (Hladunewich et al., 2009; Hou et al., 2018).
It is currently believed that PMN patients experience prolonged heavy proteinuria and hypoalbuminemia, which result in the loss of small-molecule proteins. In response, the liver compensates by synthesizing large-molecule proteins, causing abnormalities such as platelet activation, abnormal coagulation system test indexes, and reduced fibrinolytic system activity. These abnormalities contribute to a hypercoagulable state in PMN patients, making them highly susceptible to VTE (Lionaki et al., 2012; Bierzynska and Saleem, 2017; Cheung et al., 2018).
VTE is one of the most common complications of PMN, encompassing deep vein thrombosis (DVT) and pulmonary embolism (PE), as well as intra-abdominal venous thrombosis (Li et al., 2012). Among these, renal vein thrombosis (RVT) is the most prevalent, with reported incidences varying widely between 29% and 60% across different studies (Velasquez Forero et al., 1988; Singhal and Brimble, 2006; Li et al., 2012). Additionally, the occurrence of VTE is associated with the severity of proteinuria and hypoalbuminemia in PMN patients and typically manifests early in the disease course (Lionaki et al., 2012; Vestergaard et al., 2022). During the progression of PMN, the risk of VTE rapidly increases from less than 1% at the disease onset to 7%, particularly when serum albumin (ALB) levels are ≤28 g/L. Below this threshold, for every 10 g/L decrease in serum ALB, the risk of thromboembolic events nearly doubles (Lionaki et al., 2012; Couser, 2017). Research shows a markedly elevated risk of thromboembolism early in the PMN disease course, especially in the nephrotic state; the VTE incidence was 9.85% within the first 6 months (Mahmoodi et al., 2008).
However, the current predictive indicators for assessing VTE risk in PMN patients are primarily based on serum ALB levels. Recent studies have identified associations between VTE occurrence in PMN patients and factors such as D-dimer levels, proteinuria, anti-phospholipase A2 receptor antibody (aPLA2Rab) levels, and low-density lipoprotein cholesterol (LDL-C) levels (Kumar et al., 2012; Li et al., 2012; Wu et al., 2022; Zhu et al., 2022). Consequently, a scientifically rigorous method for assessing VTE risk in PMN patients remains elusive, and there is a lack of effective predictive models for the risk of VTE formation in these patients. As a core technology of artificial intelligence, machine learning is often employed to construct data analysis and predictive models. This study aims to incorporate multiple types of factors and use machine learning algorithms to develop a VTE risk prediction model for PMN patients. This model could provide crucial support for devising individualized prophylactic anticoagulation treatment strategies for PMN patients.
2 Materials and methods
2.1 Data explanation
The data for this study were obtained from the electronic health record system (EHRS) of Sichuan Provincial People’s Hospital and through telephone follow-up. The study included PMN patients who were hospitalized at Sichuan Provincial People’s Hospital between 1 January 2018, and 30 September 2024. For patients with multiple hospitalization records, in order to achieve the goal of early prediction of VTE risk and to avoid the confounding factors introduced by changes and interventions during multiple treatment processes, this study extracts data records from the patients’ first admission, and in cases where the same indicator is recorded multiple times, the initial record is taken. Inclusion criteria required patients to be diagnosed with MN via renal biopsy or to have a positive aPLA2Rab test. Exclusion criteria were as follows: (1) SMN, including hepatitis B- and C-associated MN, systemic lupus erythematosus-associated MN, thyroiditis-associated MN, tumor-associated MN, Sjögren’s syndrome-associated MN, HIV-associated MN, and ankylosing spondylitis-associated MN; (2) hospitalization due to VTE; (3) diagnosed with VTE within 24 h of hospitalization; (4) death during hospitalization; (5) inability to obtain diagnostic and treatment data; (6) lack of contact information.
The outcome variable of this study is the diagnosis of VTE-related diseases within 6 months of the initial hospitalization, such as VTE, DVT, RVT, and PE, as confirmed by vascular ultrasound examination.
In the course of the study, patients’ private information, such as names, addresses, and contact approaches, was anonymized. This study was conducted in accordance with the Declaration of Helsinki and was approved by the institutional ethics committee.
2.2 Statistical analysis
Continuous variables are expressed as mean ± standard deviation (SD). For variables following a normal distribution, the t-test is used, while the Mann-Whitney U test is employed for non-normally distributed variables. Categorical variables are presented as frequencies and percentages (%), and analyzed using the chi-squared test and Fisher’s exact test. A p-value of <0.05 is considered statistically significant. Statistical analyses are conducted using the stats module in Python 3.8, and model development is performed using the sklearn library in Python 3.8.
2.3 Variable processing
Variables with more than 90% missing values, those where the largest category accounts for more than 90% of records, those with more than 90% maximum category count, and those with a coefficient of variation less than 0.01 were excluded. Binary categorical variables were assigned values of 0 or 1. Ordinal categorical variables were assigned values of 1, 2, 3, and so on. Nominal categorical variables were transformed using one-hot encoding; for variables with more than five categories, consolidation was performed prior to encoding.
Three data imputation methods were employed: (1) Simple Imputation: For continuous variables, the median was used as the imputation value, while for categorical variables, the mode was used. (2) K-Nearest Neighbors (KNN) Imputation: The KNN algorithm was utilized to predict and replace the missing values. (3) Random Forest (RF) Imputation: The RF algorithm was applied to predict and replace the missing values. RF classification models are used for imputing categorical variables, while RF regression models are applied for imputing continuous variables. To prevent data leakage during imputation, all target labels are excluded, and only feature values are retained for the imputation process. After imputation, a consistent 8:2 train-test split was applied to all three datasets using a fixed random seed, ensuring identical samples in each respective set and producing three training sets (n = 514) and three test sets (n = 129).
Three data sampling methods were employed: (1) Synthetic Minority Oversampling Technique (SMOTE): synthesizes and supplements new samples from the minority class using a subset of original data. (2) Borderline-SMOTE (BSMOTE): an enhanced algorithm based on SMOTE that selectively generates new samples only from the minority class samples on the border, thereby improving the distribution of class samples. (3) Adaptive Synthetic (ADASYN): dynamically generates data based on the difficulty of learning from minority class samples. To counteract the continuous values generated during categorical variables imputation, the values were rounded to the nearest categories. The sample sampling was conducted solely on the training sets to allow the models to fully learn minority class characteristics, while the test sets retained their original imbalanced distribution for a realistic performance assessment. This step produced nine training sets.
Three feature screening methods were employed: (1) non-screening: models were built using all variables. (2) Lasso selection: a feature screening method based on linear regression, accurately identifying important variables. (3) LightGBM selection: utilizing the LightGBM algorithm to assess the importance of feature variables and select the top ten ranked features. (4) Frequency-based Selection (Freq-based): After performing feature screening with LASSO and LightGBM, the frequency of each feature’s occurrence was quantified across the 18 generated subset datasets. Subsequently, the top 10 most frequently occurring features were selected to reconstruct a refined feature selection subset. Following feature selection on the training sets, 36 sets were obtained. The test sets were then matched and supplemented accordingly based on the imputation method, retaining only the features present in their corresponding training sets.
2.4 Model construction
Through different data imputation techniques, data sampling, and feature screening, 36 datasets were obtained. 14 machine learning algorithms were employed, including Logistic Regression (LR), Support Vector Machine (SVM), KNN, Decision Tree, Multi-Layer Perceptron (MLP), RF, Extremely Randomized Trees (ExtRa Trees), Adaptive Boosting (AdaBoost), Gradient Boosting (GBoost), Extreme Gradient Boosting (XGBoost), Categorical Boosting (CatBoost), Natural Gradient Boosting (NGBoost), Bernoulli naive Bayes (Ber NB), and Gaussian naive Bayes (Gau NB).
The training set was used for model development and the test set was used for model evaluation. Ten-fold cross-validation was performed on the training set to internally validate the model and assess the stability of machine learning predictions across different data partitions and performance metrics.
2.5 Model evaluation
In this study, performance evaluation metrics of classification models were utilized, including receiver operating characteristic (ROC) curve and its AUC, precision-recall (P-R) curve, calibration curve, calibration slope and intercept, decision curve analysis (DCA), accuracy, precision, recall, F1-score, and Brier score. SHapley Additive exPlanations (SHAP) were employed to elucidate the contributions of variables to the models.
To assess the impact of different sample sizes on model prediction performance, Bootstrapping was conducted by randomly sampling subsets of 10%, 20%, 30%, up to 100% from the training set. These 10 subsets were used iteratively to build models, and their AUC was computed using a test set. This process was repeated 100 times to examine the effect of sample size. 200 bootstrap resampling iterations were performed to evaluate the impact of different methods of data preprocessing, modeling, and viable features. One-way ANOVA was applied in the feature sensitivity analysis.
2.6 Web-based prediction tool
Building upon the aforementioned steps, a final optimal prediction model is obtained and implemented using Streamlit to establish a web-based prediction tool. This platform incorporates multiple predictive indicators highly associated with concurrent VTE in patients with PMN, ultimately providing personalized VTE risk assessment for PMN patients.
3 Results
3.1 Research population
A total of 1,349 records of hospitalized patients diagnosed with MN were extracted and 725 cases remained after excluding duplicate visits. Following inclusion and exclusion criteria screening, SMN patients were excluded, resulting in a final cohort of 643 patients. Among these, 93 cases experienced VTE, while 550 did not. Figure 1 depicts the patient enrollment process. The demographic and clinical characteristics of the patients at baseline are summarized in Supplementary Tables 1–4.
3.2 The results of variable procession
86 indicators were identified for inclusion after data preprocessing, being categorized into nine basic demographic features, 21 comorbidities and medical history details, five medication-related factors, and 51 laboratory test results, as detailed in Supplementary Table 6. The missing performance of each variable is presented in Supplementary Table 7. Supplementary Table 8 lists the medical reference ranges for each variable, while Supplementary Table 9 outlines the specific sub-features encompassed within the merged features, such as anticoagulants, liver diseases, etc.
The initial dataset was processed for missing values using three different imputation methods, followed by dataset splitting, resulting in 36 standardized datasets. The details for each dataset are provided in Table 1.
3.3 Model building
After fitting 36 datasets using 14 machine learning algorithms, we generated a total of 504 VTE risk prediction models for PMN patients. Preliminary assessments of model performance were conducted using 10-fold cross-validation. Selected results from the 10-fold cross-validation of some models are shown in Table 2.
3.4 Model evaluation
The evaluation of model performance was conducted using the test dataset, calculating metrics including AUC, accuracy, precision, recall, F1 score, and Brier score. Among the models evaluated, the NGBoost model (Optimal Model), optimized through KNN imputation, BSMO sampling, and Frequency-based feature selection, demonstrated superior comprehensive performance in both AUC (0.911) and F1-score (0.652). Optimal Model, model_2, and model_3 ranked as the top three in AUC performance, while model_4 exhibited the best performance among classical LR models, model_5 achieved the highest F1-score. Figure 2 displays the ROC curve, P-R curve, calibration curve, and decision curve for the optimal model. The model calibration evaluation resulted in a Brier score of 0.095, indicating good overall accuracy. The calibration intercept was −0.946 and the slope was 0.556, reflecting minor over-prediction for high-risk cases and under-prediction for low-risk cases. Nevertheless, the model demonstrated good discrimination and acceptable calibration. SHAP values were used to explain the contributions of variables to the models. Figure 3 illustrates the relationship between the values of each variable in the final predictive model and the corresponding changes in SHAP values. Figure 4 displays the ranking of variable importance in the final predictive model. The results indicate that D-dimer, FDP > 5 mg/L, INR, RNS, and CHE are the five most critical variables. In the predictive model, higher SHAP values for a variable suggest a greater possibility of VTE occurrence. Figure 5 compares the performance of our optimal model against the traditional Padua score using ROC curves and confusion matrices. As shown in Figure 6, the AUC value of the test set increases with the sample size, and the curve gradually flattens, demonstrating the robustness and persuasiveness of our study. A one-way ANOVA was performed based on 200 bootstrap replicates from the test set to evaluate the effects of different data preprocessing methods, algorithmic models, and features included in the optimal model. Table 3 summarizes the results. In terms of AUC, all factors except the sampling method showed statistically significant effects on model performance. The NGBoost algorithm achieved the highest AUC value. For feature sensitivity analysis, each feature was individually excluded, and the resulting model was compared against the optimal model via bootstrap resampling. As shown in Table 4, excluding DD significantly reduced the AUC from 0.896 ± 0.013 to 0.856 ± 0.016 (p < 0.01). In contrast, excluding Statins, ALB, or AT III activity led to a slight increase in AUC (p < 0.01), suggesting potential feature redundancy or noise associated with these variables in this study.
Figure 2. The Characteristic Curves of The Optimal Model A, the ROC curve; B, the P-R curve; C, the calibration curve; D, the clinical decision curve.
Figure 3. The SHAP Diagram of The Optimal Model ALB, albumin; aPLA2Rab, anti-phospholipase A2 receptor antibody; AT III, antithrombin III; CHE, cholinesterase; DD, D-dimer; INR, international normalized ratio; umALB/Ucr, Urinary Microalbumin to Creatinine Ratio; PT, prothrombin time; RNS, relapse of nephrotic syndrome Each dot represents a sample, with red indicating higher variable values and blue indicating lower variable values. For instance, as shown in the figure, elevated DD levels may be associated with an increased risk of VTE.
Decision curve analysis (DCA) assesses the clinical value of a prediction model by estimating its net benefit across various decision thresholds. The threshold probability is the minimum probability of a positive outcome at which a clinician would initiate an intervention (e.g., a threshold of 0.3 implies initiating prophylactic anticoagulation if the predicted probability is ≥0.3). In our analysis, the decision curve exceeded the “treat all” and “treat none” benchmarks for threshold probabilities between 0.2 and 0.9. This demonstrates that following the model’s recommendations provides a superior net benefit, quantifying the clinical gains after weighing the risks and benefits, compared to blanket clinical strategies over this probability range.
The optimal model was compared with the Padua score, a common VTE risk assessment tool for medical inpatients, using the test set from this study. Patients with a Padua score ≥4 were classified as high-risk. In the ROC comparison, the optimal model’s curve remained above that of the Padua score, with AUCs of 0.911 and 0.564, respectively. The confusion matrices further highlight that the performance gap primarily stems from the identification of high-risk patients. The optimal model identified 15 true positives, compared to only six by the Padua score.
Stratified sampling with replacement was performed multiple times to draw subsamples representing varying proportions (10%–100%) of the total sample size. The AUC was calculated for each subsample. The mean AUC and standard deviation were then derived across all iterations at each sampling proportion. Error bars in Figure 6 represent the standard deviation of the AUC estimates at their respective sampling proportion.
3.5 Web-based prediction tool
Based on the aforementioned results, the functionality of rapidly constructing web pages using Streamlit was utilized to develop the optimal model of this study into a web-based prediction tool. The input and output interfaces of this tool are depicted in Figures 7, 8. The model has been deployed on a cloud server and is publicly accessible via the following link: https://predict-pmn-s-vte-risk-arvbrnedu7g8ehqolwpje2.streamlit.app/.
4 Discussion
PMN accounts for approximately 80% of cases of MN and is a common renal disease characterized by prominent symptoms of proteinuria and hypoalbuminemia (Couser, 2017). The incidence of PMN in China has been increasing annually (Hou et al., 2018). VTE represents one of the most frequent complications of PMN, encompassing RVT, DVT, and PE. VTE often presents insidiously and poses serious risks, potentially exacerbating the condition of PMN patients, necessitating appropriate VTE prevention strategies. Key to VTE prevention is accurate risk prediction and the implementation of rational preventive measures. Currently, no specific VTE risk prediction model exists for PMN patients. Therefore, this study aims to construct a VTE risk prediction model using machine learning algorithms for PMN patients, providing essential groundwork for prophylactic anticoagulation strategies.
This study investigated 86 characteristic variables among 643 PMN patients. A total of 36 datasets were generated using three data imputation methods, three data sampling methods, and three feature selection methods. Subsequently, 504 models were constructed using 14 machine learning algorithms. Univariate analysis revealed substantial impacts of data processing and modeling strategies on model performance. For data imputation, the KNN method achieved marginally higher AUC and F1-score than RF and simple mean imputation, suggesting its relative advantage in preserving data structure. Among sampling strategies, ADA and BSMOTE showed comparable discriminative ability, with only minor differences in recall and F1-score, indicating limited sensitivity to sampling techniques. Feature selection markedly influenced outcomes, where the frequent feature method outperformed alternatives (p < 0.001), supporting the value of retaining stable, high-frequency variables. Among machine learning algorithms, NGBoost, CatBoost, and RF delivered the strongest results, with NGBoost attaining the highest AUC (0.881 ± 0.004) and the lowest Brier score, highlighting its capacity for modeling complex relationships and providing well-calibrated probabilistic predictions. Overall, this study demonstrates that appropriate data preprocessing, feature selection, and algorithm choice critically influence the performance of VTE risk prediction models. In feature sensitivity analyses, we observed that exclusion of specific variables (such as statin use, ALB, and AT-III) slightly improved discriminative performance (e.g., AUC and F1-score), suggesting that these variables may introduce redundancy or noise, thereby increasing model variance and limiting generalizability. Nevertheless, given their established clinical relevance and plausibility, we retained these features in the final model to maintain interpretability and clinical completeness. These findings indicate that despite satisfactory overall performance, some feature redundancy or multicollinearity may persist. Future studies should further optimize and validate the model using larger, multi-center datasets.
For the risk prediction of VTE formation, current clinical practice employs VTE scoring systems. Examples include the (Caprini et al., 1991; Barbar et al., 2010), and (Khorana et al., 2008) scores. However, the Caprini and Khorana scores are generally not applicable to PMN patients, and the Padua score does not incorporate PMN-specific predictors for VTE risk, such as hypoalbuminemia, proteinuria, and aPLA2Rab. The Padua score, refined by Barbar et al., in 2010 and validated in medical inpatients, remains one of the most widely used VTE risk assessment tools in clinical practice. Given that PMN patients align closely with the target population for this score, we compared our model against it. Results demonstrated superior performance of our optimal model in the test set. This finding is consistent with a study of 3,277 patients hospitalized with acute exacerbation of COPD, where the Padua score also underperformed relative to the Caprini score (AUC:0.644 ± 0.023 vs.0.713 ± 0.021, p = 0.029) (Zhou et al., 2022). Yang et al. reported similar findings. Padua score demonstrates limited predictive performance in PMN patients, which primarily stems from the score’s lack of specificity for this population. The 11 Padua items do not capture several PMN-specific risk factors, such as the disease itself being an independent VTE risk factor, along with hypoalbuminemia, heavy proteinuria, and hyperlipidemia. Furthermore, certain items such as “reduced mobility” often do not align with the clinical status of PMN patients. The high weight assigned to this item further reduces the score’s applicability. Additionally, some PMN patients with severe hypoalbuminemia receive prophylactic anticoagulation, which may alter their actual VTE risk. In summary, the limited clinical relevance of several key Padua items in the PMN population contributes to its suboptimal predictive performance. Another significant study was conducted by Lee et al., in 2014 (Lee et al., 2014), who developed a Markov model integrating serum albumin levels, bleeding risks, and acceptable risk-benefit ratios to quantify the potential benefit of prophylactic anticoagulation with warfarin in MN patients. It is important to note that this model is applicable only to MN patients receiving warfarin and incorporates serum albumin levels as the sole risk factor for VTE. Additionally, some scholars argue that the HAS-BLED score may be more suitable for assessing bleeding risk (Li et al., 2023). These factors significantly limit its generalizability and external applicability. Furthermore, clinical decision algorithms published to guide prophylactic anticoagulation in MN similarly rely solely on serum albumin levels to assess VTE risk (Hofstra and Wetzels, 2016; Lin et al., 2020; Rovin et al., 2021). This study focusses on VTE risk in MN patients, addressing key issues in prophylactic anticoagulation by integrating multiple factors and overcoming limitations associated with anticoagulant use.
The optimal model ultimately incorporated ten clinical indicators associated with VTE in PMN, including DD, FDP >5 mg/L, AT-Ⅲ, ALB, and six additional validated predictors. This suggests that in making decisions regarding prophylactic anticoagulation therapy for PMN patients, not only ALB levels but also the aforementioned indicators may be valuable considerations. The elevation of DD levels is closely associated with a hypercoagulable state and increased fibrinolysis in the body (Tripodi, 2011). SHAP results indicate that lower levels of DD are associated with lower risks of VTE, which aligns with findings from Li’s study (Li et al., 2012). As a terminal product of fibrinolysis, FDP becomes elevated during thrombosis and dissolution, similar to DD. Catalina Filip et al. reported higher FDP levels in pregnant women with DVT compared to non-DVT controls in the third trimester (Filip et al., 2025). Similarly, Zheng et al. identified FDP as an independent risk factor and incorporated it into a nomogram for preoperative DVT risk in lower limb fracture patients (Zheng et al., 2024). Consistent with these findings, SHAP analysis in our model indicated that FDP levels >5 mg/L contributed to increased predicted VTE risk. The SHAP analysis similarly underscore the significant role of umALB/Ucr in the model’s predictions: higher levels are associated with an increased predicted risk of VTE. This finding is consistent with the large-scale cohort study by Massicotte-Azarniouch et al., which reported that even in individuals with normal renal function, heavy albuminuria significantly elevates the risk of VTE and was identified as an independent risk factor (Massicotte-Azarniouch et al., 2017). On the other hand, SHAP analysis suggested statin use was associated with higher predicted VTE risk. This likely reflects confounding by indication, patients already prescribed statins before admission may have had higher baseline risk. While medication history is essential for modeling, such reverse causality should be considered in interpretation. AT-III is the principal synthesized component of the anticoagulant system in the liver, providing 70% of the body’s anticoagulant activity. SHAP analysis indicates that decreased AT-III activity is associated with an increased risk of VTE. Huang et al. found that patients with PMN in a hypercoagulable state have lower AT-III activity compared to those without hypercoagulability (Huang et al., 2016). Serum albumin levels are recognized as a risk factor for VTE in PMN patients. Consistently, the SHAP analysis in this study suggests that low serum albumin levels are similarly associated with a heightened risk of VTE.
5 Limitations
This study is a single-center retrospective study, and the retrospective design inherently introduces certain biases, particularly in the accuracy of data recording and patient selection. First, the retrospective nature of data collection lacks real-time precision and may result in incomplete or inaccurate information, especially in the absence of standardized recording procedures. Although various measures were implemented during the data preprocessing to mitigate these biases, such as feature selection and techniques for handling missing data, we acknowledge that these efforts cannot entirely eliminate all limitations. In the collection of key variables, we encountered the absence of critical risk factor data, such as the Leiden factor, total plasma protein S, and prothrombin gene mutations. Such deficiency could potentially compromise the accuracy of the model’s predictions. Additionally, data selection was constrained by the retrospective nature of the study. The majority of patients had multiple clinical visits on record. Given that this study aims to predict risk and implement early VTE prevention interventions while accounting for confounding factors, we chose to include only the data from patients’ initial visits. The rationale is that first-visit data more accurately reflect patients’ real health status before substantial medical intervention. This decision, while helpful in reducing confounding factors arising from variations and interventions during treatment, also introduces a series of limitations, particularly in terms of data integrity and the comprehensiveness of disease progression information. For example, it fails to reflect the evolution of the patient’s condition, lacks in-depth assessment of the patient’s health status, and does not incorporate adjustments in treatment strategies into the model developing, thereby affecting the reliability and accuracy of the model’s predictions. Finally, the sample size in this study was relatively limited, with only 643 PMN patients included, among whom just 93 were VTE-positive. Small sample size and class imbalance pose challenges to the model’s ability to handle rare events and conduct subgroup analyses. Although techniques such as SMOTE and Borderline-SMOTE were applied to address class imbalance, the limited sample size may still constrain the model’s generalizability in clinical practice. Moreover, as this study employed a single-center retrospective design, the data were restricted to cases and patient characteristics from a specific medical center, potentially limiting the external validity and generalizability of the findings. The model’s generalization capability has not been extensively validated due to the singularity of the data source, and its applicability in different populations and healthcare settings requires further investigation. Data from a single center may not fully represent all patient groups, particularly those from diverse regions, ethnicities, or healthcare systems. The model specifically predicts symptomatic VTE risk, as only clinically symptomatic cases underwent confirmatory imaging. Asymptomatic VTE cases may have been undetected due to the absence of systematic screening.
To address the current limitations of the study and further validate the model’s performance, we propose several directions for future work. First, designing and conducting prospective studies. Compared to retrospective studies, prospective studies provide more accurate data by allowing standardized and on-demand data collection from the outset, thereby reducing the risks of missing data and information bias, which enhances the reliability of research findings and the precision of model predictions. Second, delving into the methods for processing and integrating multiple hospitalization records by developing time-series models or dynamic prediction models to meticulously consider detailed information from each patient encounter, constructing a continuous risk prediction trajectory for patients, and monitoring the temporal variations in patients’ VTE risks. Third, collaborating with other healthcare institutions to establish a multi-center database. By incorporating data from diverse regions and populations, this strategy significantly increases sample size, improves the study’s statistical ability, and enables a more comprehensive evaluation of the model’s generalizability. It also ensures its applicability and effectiveness across various healthcare settings. Through these efforts, we aim to overcome the limitations of single-center, retrospective studies and provide robust evidence for the application of the risk prediction model in real-world clinical practice, ultimately contributing to better patient outcomes.
6 Conclusion
This study utilized a single-center dataset, employing advanced statistical methods and machine learning techniques to develop and evaluate a personalized VTE risk prediction model for PMN patients. A web-based prediction tool was also created. As a single-center retrospective study, certain limitations exist, including a relatively small sample size, imbalanced sample categories, and missing data. However, data preprocessing measures such as feature selection and sample resampling were employed to significantly reduce the biases introduced by these limitations. Internal validation of the model demonstrated good predictive performance, suggesting its potential to assist in VTE prevention and management for PMN patients. Looking forward, we aim to address these limitations by conducting prospective trials or collaborating with other institutions to construct multi-center datasets, thereby paving the way for further development and broader application of the model.
Author contributions
LL: Investigation, Writing – review and editing, Validation, Writing – original draft, Visualization, Software, Data curation, Methodology, Formal Analysis. LW: Validation, Resources, Investigation, Writing – original draft, Visualization. YW: Data curation, Methodology, Writing – review and editing. HW: Writing – review and editing, Resources, Investigation, Data curation, Supervision. XZ: Visualization, Software, Writing – review and editing, Investigation. LH: Resources, Writing – review and editing, Investigation. QY: Software, Supervision, Writing – review and editing. XiW: Writing – review and editing, Project administration, Resources. YB: Funding acquisition, Writing – review and editing, Conceptualization, Resources.
Funding
The author(s) declare that financial support was received for the research and/or publication of this article. This work has been funded by National Key Research and Development Program of China (2020YFC2005500), the Natural Science Foundation of Sichuan Province (2022NSFSC0818), the Hospital Pharmacy Committee of the Chinese Pharmaceutical Association (CPA-Z05-ZC-2023-002), China International Medical Foundation (Z-2021-6-2101-2023) and Chinese Research Hospital Association (Y2022FH-YWPJ01-202).
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declare that no Generative AI was used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fphar.2025.1683708/full#supplementary-material
References
Barbar, S., Noventa, F., Rossetto, V., Ferrari, A., Brandolin, B., Perlati, M., et al. (2010). A risk assessment model for the identification of hospitalized medical patients at risk for venous thromboembolism: the padua prediction score. J. Thrombosis Haemostasis JTH 8, 2450–2457. doi:10.1111/j.1538-7836.2010.04044.x
Bierzynska, A., and Saleem, M. (2017). Recent advances in understanding and treating nephrotic syndrome. F1000Res 6, 121. doi:10.12688/f1000research.10165.1
Caprini, J. A., Arcelus, J. I., Hasty, J. H., Tamhane, A. C., and Fabrega, F. (1991). Clinical assessment of venous thromboembolic risk in surgical patients. Semin. Thromb. Hemost. 17 (Suppl. 3), 304–312.
Cheung, K. L., Zakai, N. A., Callas, P. W., Howard, G., Mahmoodi, B. K., Peralta, C. A., et al. (2018). Mechanisms and mitigating factors for venous thromboembolism in chronic kidney disease: the REGARDS study. J. Thromb. Haemost. 16, 1743–1752. doi:10.1111/jth.14235
Couser, W. G. (2017). Primary membranous nephropathy. Clin. J. Am. Soc. Nephrol. 12, 983–997. doi:10.2215/CJN.11761116
Dc, C., and Pe, B. (2017). Membranous nephropathy: integrating basic science into improved clinical management. Kidney Int. 91, 566–574. doi:10.1016/j.kint.2016.09.048
Filip, C., Matasariu, D. R., Ursache, A., Dimitriu, C. D., Filip, C., Boiculese, V. L., et al. (2025). Exploring biomarkers to predict thrombogenic risk in pregnancy. J. Clin. Med. 14, 932. doi:10.3390/jcm14030932
Hladunewich, M. A., Troyanov, S., Calafati, J., and Cattran, D. C. (2009). The natural history of the non-nephrotic membranous nephropathy patient. Clin. J. Am. Soc. Nephrol. 4, 1417–1422. doi:10.2215/CJN.01330209
Hofstra, J. M., and Wetzels, J. F. M. (2016). Should aspirin be used for primary prevention of thrombotic events in patients with membranous nephropathy? Kidney Int. 89, 981–983. doi:10.1016/j.kint.2016.01.019
Hou, J.-H., Zhu, H.-X., Zhou, M.-L., Le, W.-B., Zeng, C.-H., Liang, S.-S., et al. (2018). Changes in the spectrum of kidney diseases: an analysis of 40,759 biopsy-proven cases from 2003 to 2014 in China. Kidney Dis. (Basel) 4, 10–19. doi:10.1159/000484717
Huang, M., Wei, R., Li, Q., Yang, X., Cao, C., Su, T., et al. (2016). Hypercoagulable state evaluated by thromboelastography in patients with idiopathic membranous nephropathy. J. Thromb. Thrombolysis 41, 321–327. doi:10.1007/s11239-015-1247-x
Khorana, A. A., Kuderer, N. M., Culakova, E., Lyman, G. H., and Francis, C. W. (2008). Development and validation of a predictive model for chemotherapy-associated thrombosis. Blood 111, 4902–4907. doi:10.1182/blood-2007-10-116327
Kumar, S., Chapagain, A., Nitsch, D., and Yaqoob, M. M. (2012). Proteinuria and hypoalbuminemia are risk factors for thromboembolic events in patients with idiopathic membranous nephropathy: an observational study. BMC Nephrol. 13, 107. doi:10.1186/1471-2369-13-107
Lee, T., Biddle, A. K., Lionaki, S., Derebail, V. K., Barbour, S. J., Tannous, S., et al. (2014). Personalized prophylactic anticoagulation decision analysis in patients with membranous nephropathy. Kidney Int. 85, 1412–1420. doi:10.1038/ki.2013.476
Li, S.-J., Guo, J.-Z., Zuo, K., Zhang, J., Wu, Y., Zhou, C., et al. (2012). Thromboembolic complications in membranous nephropathy patients with nephrotic syndrome-a prospective study. Thromb. Res. 130, 501–505. doi:10.1016/j.thromres.2012.04.015
Li, X., Xie, X., Zhao, Y., Wang, G., Shao, H., and Zhang, X. (2023). Some points for the KDIGO 2021 guideline for prophylactic anticoagulation in membranous nephropathy: is it clear enough for us to follow? Nephron 147, 193–198. doi:10.1159/000525913
Lin, R., McDonald, G., Jolly, T., Batten, A., and Chacko, B. (2020). A systematic review of prophylactic anticoagulation in nephrotic syndrome. Kidney Int. Rep. 5, 435–447. doi:10.1016/j.ekir.2019.12.001
Lionaki, S., Derebail, V. K., Hogan, S. L., Barbour, S., Lee, T., Hladunewich, M., et al. (2012). Venous thromboembolism in patients with membranous nephropathy. Clin. J. Am. Soc. Nephrol. 7, 43–51. doi:10.2215/CJN.04250511
Mahmoodi, B. K., ten Kate, M. K., Waanders, F., Veeger, N. J. G. M., Brouwer, J.-L. P., Vogt, L., et al. (2008). High absolute risks and predictors of venous and arterial thromboembolic events in patients with nephrotic syndrome: results from a large retrospective cohort study. Circulation 117, 224–230. doi:10.1161/CIRCULATIONAHA.107.716951
Massicotte-Azarniouch, D., Bader Eddeen, A., LazoLanger, A., Molnar, A. O., Lam, N. N., McCallum, M. K., et al. (2017). Risk of venous thromboembolism in patients by Albuminuria and estimated GFR. Am. J. Kidney Dis. 70, 826–833. doi:10.1053/j.ajkd.2017.07.003
Ronco, P., and Debiec, H. (2015). Pathophysiological advances in membranous nephropathy: time for a shift in patient’s care. Lancet 385, 1983–1992. doi:10.1016/S0140-6736(15)60731-0
Rovin, B. H., Adler, S. G., Barratt, J., Bridoux, F., Burdge, K. A., Chan, T. M., et al. (2021). Executive summary of the KDIGO 2021 Guideline for the Management of Glomerular Diseases. Kidney Int. 100, 753–779. doi:10.1016/j.kint.2021.05.015
Singhal, R., and Brimble, K. S. (2006). Thromboembolic complications in the nephrotic syndrome: pathophysiology and clinical management. Thromb. Res. 118, 397–407. doi:10.1016/j.thromres.2005.03.030
Tripodi, A. (2011). D-dimer testing in laboratory practice. Clin. Chem. 57, 1256–1262. doi:10.1373/clinchem.2011.166249
Velasquez Forero, F., Garcia Prugue, N., and Ruiz Morales, N. (1988). Idiopathic nephrotic syndrome of the adult with asymptomatic thrombosis of the renal vein. Am. J. Nephrol. 8, 457–462. doi:10.1159/000167654
Vestergaard, S. V., Birn, H., Darvalics, B., Nitsch, D., Sørensen, H. T., and Christiansen, C. F. (2022). Risk of arterial thromboembolism, venous thromboembolism, and bleeding in patients with nephrotic syndrome: a population-based cohort Study. Am. J. Med. 135, 615–625.e9. doi:10.1016/j.amjmed.2021.11.018
Wu, H. H. L., Alozai, A., Li, J. W. C., Elmowafy, A., Ponnusamy, A., Woywodt, A., et al. (2022). Risk factors of venous thromboembolism in anti-PLA2R-positive and negative primary membranous nephropathy. Clin. Kidney J. 15, 1636–1638. doi:10.1093/ckj/sfac052
Zheng, F., Chen, X., Huang, J., and Lin, C. (2024). Study on risk factors of preoperative deep vein thrombosis in patients with lower limb fractures and construction and validation of risk prediction nomogram model. BMC Surg. 24, 408. doi:10.1186/s12893-024-02718-3
Zhou, C., Yi, Q., Ge, H., Wei, H., Liu, H., Zhang, J., et al. (2022). Validation of risk assessment models predicting venous thromboembolism in inpatients with acute exacerbation of chronic obstructive pulmonary disease: a multicenter cohort Study in China. Thromb. Haemost. 122, 1177–1185. doi:10.1055/a-1693-0063
Zhu, H., Xu, L., Liu, X., Liu, B., Zhai, C., Wang, R., et al. (2022). Anti-PLA2R antibody measured by ELISA predicts the risk of vein thrombosis in patients with primary membranous nephropathy. Ren. Fail 44, 594–600. doi:10.1080/0886022X.2022.2057861
Glossary
AdaBoost Adaptive Boosting
ADASYN Adaptive Synthetic
AFP alpha fetoprotein
A/G albumin/globulin
ALB albumin
ALT alanine transaminase
ALP alkaline phosphatase
APTR activated partial thromboplastin ratio
aPLA2Rab anti-phospholipase A2 receptor antibody
AST aspartate aminotransferase
AT III antithrombin III
AUC area under the curve
Ber NB Bernoulli naive Bayes
BNP brain natriuretic peptide
CA carbohydrate antigens
CatBoost Categorical Boosting
CHE cholinesterase
CK 19 cytokeratin 19
CK-MB creatine kinase isoenzyme MB
Cl chlorine
DCA decision curve analysis
DD D-dimer
DVT deep venous thrombosis
eGFR estimated glomerular filtration rate
EHRS electronic health record system
EO% eosinophil ratio
ESR erythrocyte sedimentation rate
ExtRa Trees Extremely Randomized Trees
FDP fibrinogen degradation products
FOB fecal occult blood
FIB fibrinogen
Gau NB Gaussian naive Bayes
GBoost Gradient Boosting
GLB globulin
Hcy homocysteine
hs-CRP hypersensitive-C reactive protein
INR international normalized ratio
KNN K-Nearest Neighbor
LDL-C low-density lipoprotein cholesterol
LightGBM Light Gradient Boosting Machine
Lp(a) lipoprotein a
LR logistic regression
MCV mean volume of red blood cells
MLP multi-layer perceptron
MN membranous nephropathy
MYO myoglobin
NE neutrophilic granulocyte
NEUT% neutrophil ratio
NGBoost Natural Gradient Boosting
PCT procalcitonin
PE pulmonary embolism
PLT platelet count
PMN primary membranous nephropathy
P-R Precision-Recall
PT prothrombin time
RF random forest
ROC receiver operating characteristic
RNS relapse of nephrotic syndrome
RVT renal vein thrombosis
SD standard deviation
SHAP shapley additive explanations
SMOTE synthetic minority oversampling Technique
BSMOTE borderline-SMOTE
SVM support vector machine
TP total protein
TT thrombin time
UA uric acid
uALB urinary albumin
Ucr urine creatinine
umALB urinary microalbumin
VTE venous thromboembolism
WBC white blood cell
XGBoost extreme Gradient Boosting.
Keywords: primary membranous nephropathy, venous thromboembolism, riskfactors, machine learning, prediction model
Citation: Li L, Wu L, Wang Y, Wang H, Zheng X, Han L, Yin Q, Wu X and Bian Y (2025) Development of a venous thromboembolism risk prediction model for patients with primary membranous nephropathy based on machine learning. Front. Pharmacol. 16:1683708. doi: 10.3389/fphar.2025.1683708
Received: 21 August 2025; Accepted: 20 October 2025;
Published: 06 November 2025.
Edited by:
Marco Allinovi, Careggi University Hospital, ItalyReviewed by:
Osama Nady Mohamed, Minia University, EgyptArwa Jalal Eddine, Hôpital Foch, France
Tiffany Pon, University of California San Francisco, United States
Copyright © 2025 Li, Wu, Wang, Wang, Zheng, Han, Yin, Wu and Bian. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Yuan Bian, Ymlhbnl1YW41NjdAMTI2LmNvbQ==; Xingwei Wu, d3V4aW5nd2VpQG1lZC51ZXN0Yy5lZHUuY24=; Qinan Yin, NTIyOTIyNjMzQHFxLmNvbQ==
†These authors share first authorship
Hulin Wang1†