Predicting the risk of postoperative avascular necrosis in patients with talar fractures based on an interpretable machine learning model

Zhang, Jian; Xu, Jihai; Yu, Jiapei; Chen, Hong; Hong, Xin; Zhang, Songou; Wang, Xin; Shen, Chengchun

doi:10.3389/fbioe.2025.1644261

ORIGINAL RESEARCH article

Front. Bioeng. Biotechnol., 31 July 2025

Sec. Biomechanics

Volume 13 - 2025 | https://doi.org/10.3389/fbioe.2025.1644261

This article is part of the Research TopicEnhancing Sports Injury Management through Medical-Engineering InnovationsView all 23 articles

Predicting the risk of postoperative avascular necrosis in patients with talar fractures based on an interpretable machine learning model

Jian Zhang^1,2,3^†

Jihai Xu^2,4^†

Jiapei Yu^1,2,5^†

Hong Chen^2,5

Xin Hong³

Songou Zhang⁶*

Xin Wang^2,4*

Chengchun Shen^1,2,5*

¹Department of Orthopaedics, Ningbo No.6 Hospital, Ningbo, China
²Ningbo Clinical Research Center for Orthopedics, Sports Medicine and Rehabilitation, Ningbo, China
³Department of Orthopedics, Zhongda Hospital of Southeast University, Nanjing, China
⁴Department of Plastic Reconstructive Surgery and Hand Microsurgery, Ningbo No.6 Hospital, Ningbo, China
⁵Department of Foot and Ankle Surgery, Ningbo No.6 Hospital, Ningbo, China
⁶Department of Clinical Medicine, Health Science Center, Ningbo University, Ningbo, China

Purpose: This study aims to develop and validate an interpretable machine learning model for predicting avascular necrosis (AVN) following talar fracture, thereby aiding in personalized prevention and treatment.

Methods: A retrospective cohort study included patients undergoing surgical intervention for talar fractures at Ningbo No.6 Hospital between January 2018 and December 2023. Multidimensional data encompassing demographic characteristics, fracture-related variables, surgery-related parameters, and follow-up information were collected. Patients were randomly allocated to the training and testing sets in a 7:3 ratio. Potential risk factors for postoperative AVN were screened using univariate and multivariate logistic regression analyses. Six machine learning algorithms were employed to construct the prediction models. The performance of the prediction model was evaluated utilizing metrics including area under the receiver operating characteristic curve (AUC), calibration curves, decision curve analysis (DCA), accuracy, sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), precision, recall, and F1 score. The SHapley Additive exPlanations (SHAP) provided global and local explanations for the optimal model.

Results: A total of 207 patients with talar fractures were enrolled in our study, with 45 (21.74%) developed AVN, and 162 (78.26%) did not. Univariate and multivariable logistic regression identified six independent risk factors including body mass index (BMI), fracture classification, concomitant ipsilateral foot and ankle fractures, smoking, quality of fracture reduction, and fracture type. Performance evaluation demonstrated that Extreme Gradient Boosting (XGBoost model) achieved high AUC values with superior specificity and sensitivity in both the training and testing sets. The SHAP was performed to analyze the relative importance of features within the model visually and illustrate the impact of each feature on individual patient outcomes.

Conclusion: This study successfully developed and validated an interpretable machine learning model incorporating key clinical and surgical variables to predict AVN following talar fractures. The prediction model identified high-risk patients and critical modifiable factors, facilitating personalized prevention strategies to mitigate this severe complication.

1 Introduction

Talar fractures are relatively rare injuries, accounting for 0.1%–2.5% of all fractures and 3%–5% of foot and ankle fractures (Saravi et al., 2021). Despite advancements in the diagnosis and treatment of talar fractures, complication rates remain high, and functional outcomes are generally unsatisfactory (Choi et al., 2023). The unique anatomical structure of the talus, characterized by retrograde blood supply via the tarsal canal artery, minimal ligament and tendon attachments, and limited non-articular surfaces resulting in poor vascularization, predisposes it to vascular compromise following high-energy trauma (Kubisa et al., 2024). Consequently, avascular necrosis (AVN) is one of the major complications in patients with talar fractures, with an incidence rate as high as 31.2% (Dodd and Lefaivre, 2015). Patients with early-stage AVN are usually asymptomatic. Consequently, the majority of patients present to the clinic at a late stage with long-term functional impairment that significantly disrupts their quality of life, and ultimately necessitates interventions such as ankle arthrodesis or joint replacement. Therefore, early prediction and identification of risk factors for AVN following talar fractures are critical to optimizing treatment strategies and improving patient outcomes.

Previous studies have reported traditional risk factors for AVN following talar fractures such as high body mass index (BMI) increasing local mechanical stress on the talus and tobacco smoking which impairs local blood supply (Alley et al., 2024). However, most studies are not comprehensive in terms of risk factors, and simple risk factor analysis has limited clinical application. In addition, radiographic examinations such as computed tomography (CT) and magnetic resonance imaging (MRI), are employed to assess vascular integrity to predict AVN (Chen et al., 2014; Kubisa et al., 2024). However, these parameters inadequately reflect the multifaceted and complex pathophysiological processes that contribute to the development and progression of AVN. Consequently, constructing risk models based on comprehensive clinical characteristics to predict AVN following talar fractures can assist clinicians in developing patient-specific management measures and represents a key strategy for AVN prevention.

Machine learning is a subset of artificial intelligence that focuses on the application of algorithms to analyze complex datasets and learn from previous experience, surpassing traditional methods in predicting clinical outcomes (Churpek et al., 2020; Haug and Drazen, 2023). Recent studies have demonstrated the widespread application of machine learning in the field of orthopaedics, such as in the early detection of implant failure and bone nonunion (Harris et al., 2018; Karnuta et al., 2021). However, its application in predicting complications following talar fractures remains underexplored. In addition, machine learning techniques are often considered “black-box” because explaining the decision-making process of the algorithm is complex and challenging (Fanizzi et al., 2024; Hu et al., 2024). The SHapley Additive exPlanations (SHAP), a component of Explainable Artificial Intelligence (XAI), provides transparent explanations of machine learning decisions and elucidates the rationale behind predictions (Wang et al., 2023), thereby addressing the “black-box” limitation by revealing the mechanisms underlying model decisions.

Therefore, this study aimed to develop and validate an explainable prediction model for AVN following talar fracture surgery by leveraging six advanced machine learning algorithms and integrating multidimensional data including patient clinical, radiographic, and operative variables. Subsequently, we evaluated model performance to identify the optimal algorithm and incorporated SHAP analysis to improve interpretability. Our study aimed to provide guidance for surgeons in implementing personalized prevention and treatment strategies by identifying high-risk patients with AVN, and ultimately reduce the morbidity associated with this devastating complication.

2 Methods and materials

2.1 Study population

This study enrolled patients with talar fractures who underwent surgical interventions at Ningbo No.6 Hospital between January 2018 and December 2023. Inclusion criteria: (1) patients diagnosed with fresh talar fractures (time from injury to surgery <3 weeks); (2) patients who underwent internal fixation; (3) age ≥18 years; (4) patients with complete clinical data and follow-up > 12 months. Exclusion criteria: (1) primary arthrodesis or amputation; (2) previous ankle or foot surgery; (3) severe foot neuropathy or vascular insufficiency; (4) patients with serious clinical or laboratory data missing; (5) incomplete follow-up information. The study was conducted in accordance with the Declaration of Helsinki and approved by the Institutional Review Board of Ningbo No.6 Hospital. The need for individual patient consent was waived by the Institutional Review Board due to the retrospective nature of the study and the use of anonymized data.

2.2 Data collection and processing

Baseline variables were selected based on clinical expertise and relevant literature. Clinical data were extracted from electronic medical records and categorized as follows: (1) demographic characteristics including gender, age, ASA class (American Society of Anesthesiologist physical status classification), BMI, hypertension, diabetes, heart disease, smoking, and drinking; (2) fracture-related variables including injury mechanism, fracture side, fracture classification (Hawkins classification for talar neck fractures and Sneppen classification for talar body fractures), fracture type, and concomitant ipsilateral foot and ankle fractures (Srinath et al., 2024). (3) surgery-related parameters including time to surgery, surgical strategy, fixation method, surgical approach, lateral malleolus osteotomy, medial malleolus osteotomy, intraoperative blood loss, operating duration, and quality of fracture reduction. (4) follow-up information including follow-up time and fixation removal. AVN was diagnosed based on radiographic criteria, including sclerosis, cystic changes, or talar collapse observed on postoperative imaging including (plain radiographs, CT, or MRI) (Alley et al., 2024).

2.3 Factor screening

The dataset was randomly partitioned into training (70%) and testing (30%) sets. The training dataset was utilized to develop the prediction model, while the test dataset was reserved for independent validation. The variables in the training set were initially screened by univariate logistic regression analyses. Subsequently, the variables meeting the significance threshold (P < 0.05) were included in multivariate logistic regression analyses. Ultimately, the variables that demonstrated statistical significance in the multivariate logistic regression were incorporated into machine learning algorithms for prediction model construction (Du et al., 2025).

2.4 Model development and comparison

Six machine learning algorithms were employed in this study including Random Forest (RF), NaiveBayes (NB), Gradient Boosting Machine (GBM), K-Nearest Neighbors (KNN), Extra Trees (ET), and Extreme Gradient Boosting (XGBoost). Hyperparameters were optimized using grid search combined with manual fine-tuning (Supplementary Table S1). The training set was exploited to construct prediction models and the performance of different algorithms was compared (Li et al., 2025).

Receiver Operating Characteristic (ROC) curves were utilized to evaluate the accuracy of each model, with the Area Under the Curve (AUC) serving as a performance metric. Additionally, Decision Curve Analysis (DCA) and calibration curves were plotted to assess the clinical applicability and calibration of the models. Additional performance metrics were evaluated, including accuracy, sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), precision, recall, and F1 score (Quan et al., 2024).

2.5 Interpretation tools for the model

To address the “black-box” nature of machine learning models, the SHAP (v1.8.5) was implemented using KernelExplainer for model-agnostic interpretation. This approach ranks the importance of input features and provides explanations for model predictions. The SHAP offers both global and local explanations: global explanations provide consistent and accurate attribution values for each feature, indicating their contribution to the final prediction, while local explanations provide a tailored risk assessment for each patient by assessing the contribution of features to an individual prediction.

2.6 Statistical analysis

Statistical analysis was performed using Python version 3.11.4, and a significant difference was set as P < 0.05. Continuous variables were analyzed using Student’s t-test or Mann-Whitney U test, while categorical variables were assessed using the chi-square test or Fisher’s exact test, depending on the data distribution.

3 Results

3.1 Patient characteristics

A total of 207 patients undergoing surgical intervention for talar fractures were enrolled, while 165 patients were excluded based on the inclusion and exclusion criteria (Supplementary Figure S1). Complete case analysis was performed and no imputation or data augmentation was applied. The baseline characteristics of the included patients are summarized in Table 1. Among these patients, 45 (21.74%) developed AVN following talar fractures, and 162 (78.26%) did not.Patients were randomly allocated to a training set (n = 144, 70%) and a test set (n = 63, 30%). Baseline characteristics were comparable between the training and test sets, with no statistically significant differences (Table 2).

Table 1

Table 1. Comparison of baseline characteristics between training and testing sets.

Table 2

Table 2. Comparison of baseline characteristics between the Non-AVN and AVN groups.

3.2 Univariate and multivariable logistic regression

Univariate logistic regression analysis identified several variables significantly associated with the development of AVN following talar fractures, including operating duration, intraoperative blood loss, BMI, fracture classification, surgical approach, medial malleolus osteotomy, fixation removal, concomitant ipsilateral foot and ankle fractures, smoke, surgical strategy, quality of fracture reduction, and fracture type. Multivariable logistic regression further confirmed six independent risk factors: BMI, fracture classification, concomitant ipsilateral foot and ankle fractures, smoking, quality of fracture reduction, and fracture type (Table 3).

Table 3

Table 3. Univariate logistic regression analysis and multivariate logistic regression analysis.

3.3 Model building and performance evaluation

Using the six independent risk factors identified by multivariable logistic regression, we constructed six machine learning models in the training set. Predictive performance was evaluated using five-fold cross-validation and assessed with metrics including AUC, calibration curves, and DCA. The results of AUC demonstrated that all models exhibited outstanding predictive performance, with XGBoost achieving the highest diagnostic accuracy in both the training and testing sets (Figures 1A,B). Additionally, XGBoost showed the best performance in terms of calibration curves and DCA curves, indicating superior calibration and clinical applicability (Figures 1C–F). To comprehensively evaluate model performance, we calculated additional metrics, including accuracy, sensitivity, specificity, PPV, NPV, precision, recall, and F1 score for all six machine learning models in both the training and testing sets (Figures 2A,B). Based on the combined evaluation of all metrics, XGBoost was the most accurate and reliable for predicting AVN following talar fractures.

Figure 1

Panel A and B show ROC curves for various models, comparing sensitivity to 1-specificity in training and testing sets, respectively. Panels C and D feature net benefit curves across threshold probabilities for both datasets. Panels E and F present calibration plots illustrating predicted versus observed probabilities for training and testing sets. Different models—RandomForest, NaiveBayes, KNN, ExtraTrees, GradientBoosting, XGBoost—are represented with distinct colored lines.

Figure 1. The comprehensive analysis of six machine learning models. (A) The ROC curve of the training set. (B) The ROC curve of the testing set. (C) The DCA curve of the training set. (D) The DCA curve of the testing set. (E) The calibration curve of the training set. (F) The calibration curve of the testing set.

Figure 2

Heatmaps depict the performance of various machine learning models on training and testing datasets. Panel A shows training set performance, and panel B shows testing set performance for models like RandomForest, NaiveBayes, KNN, ExtraTrees, GradientBoosting, and XGBoost. Metrics include AUC, Accuracy, Sensitivity, Specificity, PPV, NPV, Precision, Recall, and F1-Score, with values ranging from 0.55 to 0.95, indicated by a color scale.

Figure 2. Performance indicators of six machine learning models in both the training and testing sets. (A) The training set. (B) The testing set.

Waterfall charts demonstrated that XGBoost model exhibited strong predictive performance in both the training and testing sets, as shown in Figures 3A,B. Additionally, confusion matrices were constructed to evaluate the model’s performance and transparency in predicting AVN (Figures 3C,D). The results revealed that XGBoost model achieved excellent predictive accuracy, with high sensitivity and specificity in both datasets.

Figure 3

Four panels show performance graphs and confusion matrices for training and testing sets. Panel A: Bar chart of training set predictions with blue bars for label 0 and orange bars for label 1. Panel B: Similar bar chart for testing set. Panel C: Confusion matrix for training set showing 106 true negatives, 7 false positives, 2 false negatives, and 29 true positives. Panel D: Confusion matrix for testing set showing 42 true negatives, 7 false positives, 1 false negative, and 13 true positives. Color gradients indicate value intensities.

Figure 3. Waterfall chart and confusion matrix of XGBoost model. (A) Waterfall chart of the training set. (B) Waterfall chart of the testing set. (C) Confusion matrix of the training set. (D) Confusion matrix of the testing set.

3.4 Model explanation

To enhance clinical interpretability, we utilized the SHAP method to explain the final XGBoost model. This approach provided two types of explanations: global explanations of the model at the feature level and local explanations at the individual level. Global explanations, illustrated in the SHAP summary plot, ranked the features based on their contribution to the model using the SHAP mean values. Smoking, BMI, and concomitant ipsilateral foot and ankle fractures were identified as the three most important predictors of AVN (Figure 4A). In addition, the SHAP dependence plot illustrated the influence of individual features on model predictions, with red representing high risk values and blue representing low risk values (Figure 4B). In SHAP analysis, positive SHAP values for features such as smoking and BMI indicate an elevated risk of AVN, whereas negative values suggest a protective effect. For instance, higher BMI values are associated with increased AVN risk, attributable to greater mechanical stress and metabolic disturbances. Conversely, certain fracture classifications with negative SHAP values correlate with reduced AVN risk, likely reflecting lesser fracture displacement and diminished vascular compromise.

Figure 4

Four panels illustrating SHAP analysis for predictors of a medical outcome. Panel A shows a bar chart of mean SHAP values with factors like fracture classification and smoking listed. Panel B is a scatter plot showing SHAP values' impact on model output, with features like ipsilateral foot fractures. Panel C is a decision plot, highlighting features such as fracture classification with corresponding base values. Panel D presents a force plot with feature contributions for variables including smoking and BMI displayed as colored bars.

Figure 4. Interpretation of XGBoost model using the SHAP. (A) Importance ranking of features displayed by the SHAP. (B) Characterization attributes in the SHAP. (C) Examples of explicable outcomes of a patient suffering from AVN following talar fractures. (D) The SHAP values of a patient suffering from AVN following talar fractures.

For local explanations, we analyzed specific patients to understand how their individual characteristics contributed to the prediction of AVN. Figures 4C,D illustrated the SHAP force plot for a patient who developed AVN. Red features indicated a facilitating effect on the occurrence of AVN. On the contrary, blue features represented an inhibitory effect, and the length of the arrow represents the magnitude of the feature’s contribution.

4 Discussion

This study successfully developed and validated a prediction model for AVN following talar fractures by applying machine learning. We identified BMI, fracture classification, concomitant ipsilateral foot and ankle fractures, smoke, quality of fracture reduction, and fracture type as key risk factors for AVN. The XGBoost model demonstrated robust discriminatory and calibration capabilities, providing valuable clinical guidance and highlighting the potential of machine learning for predicting orthopedic postoperative complications.

Our findings underscore the critical role of smoking as the most influential predictor in the model, attributable to its detrimental effects on vascular endothelial function and local blood supply (Patel et al., 2013). Smoking induces vasospasm, thrombosis, and microcirculatory disturbances, reducing blood supply and increasing the risk of AVN following talar fractures (Kondo et al., 2019). The result was consistent with previous studies and emphasized the critical importance of preoperative smoking cessation, especially for talar fracture patients.

Patients with high BMI often exhibit obesity-related metabolic dysregulation and chronic inflammation, potentially disrupting the microenvironment necessary for fracture healing. Fang et al. reported that the incidence of hyperlipidemia is significantly higher in high BMI patients, and hyperlipidaemia increases the risk of AVN by forming fat plugs that hinder neovascularisation (Pei et al., 2020). In addition, elevated BMI may increase local mechanical stress on the talus, potentially raising the risk of fracture displacement (Collins et al., 2018). Our finding highlighted the need for comprehensive preoperative assessment and targeted weight management strategies in high BMI patients to optimize outcomes and minimize the risk of AVN.

Concomitant ipsilateral foot and ankle fractures, typically indicative of higher-energy trauma, further compromise the talus’s blood supply and surrounding soft tissues (Srinath et al., 2024). In addition, ipsilateral foot and ankle fractures may limit postoperative rehabilitation activities, indirectly impairing blood circulation. Zhang et al. reported that inflammatory markers and osteoclast activity were elevated in multiple fractures compared with single fractures (Zhang et al., 2021). Furthermore, Zheng et al. suggested that the chronic inflammatory microenvironment regulated by bone immune abnormalities may contribute significantly to AVN pathogenesis (Zheng et al., 2022). Our findings emphasized the importance of recognizing and managing comorbid injuries in patients with talar fractures, as they necessitate tailored surgical and rehabilitation protocols to mitigate the risk of AVN.

Fracture type and classification reflect injury severity and anatomical disruption (Jordan et al., 2017). Our results demonstrate a higher incidence of AVN in patients with talar neck combined with body fracture, potentially enhancing the prognostic utility of the Hawkins and Sneppen classification system (Vallier et al., 2014; Mechas et al., 2023). Additionally, the quality of fracture reduction emerged as a significant predictor of AVN, as anatomical reduction maximizes the restoration of blood circulation around the talus. We defined poor reduction as >2 mm displacement or >5° neck angulation, consistent with the research of Biz (Biz et al., 2019). This underscores the critical importance of meticulous surgical technique and appropriate fixation to achieve and maintain optimal reduction, thereby reducing the risk of AVN.

The application of machine learning in this study demonstrates its tremendous capabilities in orthopaedics. By leveraging multidimensional clinical data, machine learning models can automatically identify complex data patterns and provide personalized predictions, offering advantages over traditional statistical methods in handling nonlinear relationships and high-dimensional data. Despite promising results, this study has certain limitations. As a retrospective study, the reliance on medical records may introduce limitations in data quality and reduce the credibility of the evidence. Additionally, the performance of machine learning models is contingent on the diversity and representativeness of the training data, and our single-center study design may limit model generalizability. Performance may vary across institutions due to surgical technique heterogeneity or demographic differences. Therefore, future multicenter prospective studies with larger samples are warranted to enhance generalizability and clinical applicability. Furthermore, self-reported factors like smoking and alcohol use are susceptible to reporting bias. BMI and smoking may proxy unmeasured confounders such as hyperlipidemia or sedentary behavior. Although SHAP quantifies feature contributions, residual confounding could bias interpretations. Hence, incorporating serological markers such as lipid profiles and objective lifestyle measures with rigorous follow-up protocols would further validate the reliability of the model.

5 Conclusion

In conclusion, our study developed a novel predictive framework for AVN following talar fractures, leveraging machine learning to identify key risk factors and assess their contributions to the development of this complication. The findings advance our understanding of the pathophysiology of AVN and offer practical insights for clinicians to optimize surgical planning and postoperative management. However, due to the lack of external validation of the present study, future multicenter validation and refinement are warranted to ensure broader clinical applicability and effectiveness.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving humans were approved by Institutional Review Board of Ningbo No.6 Hospital. The studies were conducted in accordance with the local legislation and institutional requirements. The ethics committee/institutional review board waived the requirement of written informed consent for participation from the participants or the participants’; legal guardians/next of kin because the retrospective nature of the study and the use of anonymized data.

Author contributions

JZ: Conceptualization, Data curation, Formal Analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft. JX: Conceptualization, Data curation, Formal Analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing – original draft. JY: Data curation, Methodology, Writing – original draft. HC: Writing – review and editing. XH: Writing – review and editing. SZ: Conceptualization, Supervision, Writing – review and editing. XW: Funding acquisition, Project administration, Supervision, Validation, Writing – review and editing. CS: Funding acquisition, Project administration, Supervision, Writing – review and editing.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. This study was supported by Science and Technology Project in Yinzhou District, Ningbo City, Zhejiang Province (2025AS032) and Ningbo Medical Science and Technology Plan Project (2024Y524), and Ningbo Clinical Research Center for Orthopedics, Sports Medicine and Rehabilitation (2024L004).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fbioe.2025.1644261/full#supplementary-material

References

Alley, M. C., Vallier, H. A., and Tornetta, P. (2024). Identifying risk factors for osteonecrosis after talar fracture. J. Orthop. Trauma 38 (1), 25–30. doi:10.1097/bot.0000000000002706

PubMed Abstract | CrossRef Full Text | Google Scholar

Biz, C., Golin, N., De Cicco, M., Maschio, N., Fantoni, I., Frizziero, A., et al. (2019). Long-term radiographic and clinical-functional outcomes of isolated, displaced, closed talar neck and body fractures treated by ORIF: the timing of surgical management. BMC Musculoskelet. Disord. 20 (1), 363. doi:10.1186/s12891-019-2738-2

PubMed Abstract | CrossRef Full Text | Google Scholar

Chen, H., Liu, W., Deng, L., and Song, W. (2014). The prognostic value of the hawkins sign and diagnostic value of MRI after talar neck fractures. Foot Ankle Int. 35 (12), 1255–1261. doi:10.1177/1071100714547219

PubMed Abstract | CrossRef Full Text | Google Scholar

Choi, J. Y., Kim, H. S., Ngissah, R., and Suh, J. S. (2023). Operative outcomes of a high-grade talar neck fracture - lessons from 20 years' clinical experience in a single, tertiary hospital. Foot Ankle Surg. 29 (2), 118–127. doi:10.1016/j.fas.2022.12.002

PubMed Abstract | CrossRef Full Text | Google Scholar

Churpek, M. M., Carey, K. A., Edelson, D. P., Singh, T., Astor, B. C., Gilbert, E. R., et al. (2020). Internal and external validation of a machine learning risk score for acute kidney injury. JAMA Netw. Open 3 (8), e2012892. doi:10.1001/jamanetworkopen.2020.12892

PubMed Abstract | CrossRef Full Text | Google Scholar

Collins, A. T., Kulvaranon, M. L., Cutcliffe, H. C., Utturkar, G. M., Smith, W. A. R., Spritzer, C. E., et al. (2018). Obesity alters the in vivo mechanical response and biochemical properties of cartilage as measured by MRI. Arthritis Res. Ther. 20 (1), 232. doi:10.1186/s13075-018-1727-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Dodd, A., and Lefaivre, K. A. (2015). Outcomes of talar neck fractures: a systematic review and meta-analysis. J. Orthop. Trauma 29 (5), 210–215. doi:10.1097/bot.0000000000000297

PubMed Abstract | CrossRef Full Text | Google Scholar

Du, S., Wu, Y., Tao, J., Shu, L., Yan, T., Xiao, B., et al. (2025). Development and validation of machine learning models for outcome prediction in patients with poor-grade aneurysmal subarachnoid hemorrhage following endovascular treatment. Ther. Clin. Risk Manag. 21, 293–307. doi:10.2147/tcrm.S504745

PubMed Abstract | CrossRef Full Text | Google Scholar

Fanizzi, A., Arezzo, F., Cormio, G., Comes, M. C., Cazzato, G., Boldrini, L., et al. (2024). An explainable machine learning model to solid adnexal masses diagnosis based on clinical data and qualitative ultrasound indicators. Cancer Med. 13 (12), e7425. doi:10.1002/cam4.7425

PubMed Abstract | CrossRef Full Text | Google Scholar

Harris, A. H., Kuo, A. C., Bowe, T., Gupta, S., Nordin, D., and Giori, N. J. (2018). Prediction models for 30-Day mortality and complications after total knee and hip arthroplasties for veteran health administration patients with osteoarthritis. J. Arthroplasty 33 (5), 1539–1545. doi:10.1016/j.arth.2017.12.003

PubMed Abstract | CrossRef Full Text | Google Scholar

Haug, C. J., and Drazen, J. M. (2023). Artificial intelligence and machine learning in clinical medicine, 2023. N. Engl. J. Med. 388 (13), 1201–1208. doi:10.1056/NEJMra2302038

PubMed Abstract | CrossRef Full Text | Google Scholar

Hu, J., Xu, J., Li, M., Jiang, Z., Mao, J., Feng, L., et al. (2024). Identification and validation of an explainable prediction model of acute kidney injury with prognostic implications in critically ill children: a prospective multicenter cohort study. EClinicalMedicine 68, 102409. doi:10.1016/j.eclinm.2023.102409

PubMed Abstract | CrossRef Full Text | Google Scholar

Jordan, R. K., Bafna, K. R., Liu, J., and Ebraheim, N. A. (2017). Complications of talar neck fractures by hawkins classification: a systematic review. J. Foot Ankle Surg. 56 (4), 817–821. doi:10.1053/j.jfas.2017.04.013

PubMed Abstract | CrossRef Full Text | Google Scholar

Karnuta, J. M., Haeberle, H. S., Luu, B. C., Roth, A. L., Molloy, R. M., Nystrom, L. M., et al. (2021). Artificial intelligence to identify arthroplasty implants from radiographs of the hip. J. Arthroplasty 36 (7s), S290–S294.e1. doi:10.1016/j.arth.2020.11.015

PubMed Abstract | CrossRef Full Text | Google Scholar

Kondo, T., Nakano, Y., Adachi, S., and Murohara, T. (2019). Effects of tobacco smoking on cardiovascular disease. Circ. J. 83 (10), 1980–1985. doi:10.1253/circj.CJ-19-0323

PubMed Abstract | CrossRef Full Text | Google Scholar

Kubisa, M. J., Kubisa, M. G., Pałka, K., Sobczyk, J., Bubieńczyk, F., and Łęgosz, P. (2024). Avascular necrosis of the talus: diagnosis, treatment, and modern reconstructive options. Med. Kaunas. 60 (10), 1692. doi:10.3390/medicina60101692

PubMed Abstract | CrossRef Full Text | Google Scholar

Li, L., Yang, X., Guo, W., Wu, W., Guo, M., Li, H., et al. (2025). Predicting the risk of postoperative gastrointestinal bleeding in patients with type A aortic dissection based on an interpretable machine learning model. Front. Med. (Lausanne) 12, 1554579. doi:10.3389/fmed.2025.1554579

PubMed Abstract | CrossRef Full Text | Google Scholar

Mechas, C. A., Aneja, A., Nazal, M. R., Pectol, R. W., Sneed, C. R., Foster, J. A., et al. (2023). Association of talar neck fractures with body extension and risk of avascular necrosis. Foot Ankle Int. 44 (5), 392–400. doi:10.1177/10711007231160751

PubMed Abstract | CrossRef Full Text | Google Scholar

Patel, R. A., Wilson, R. F., Patel, P. A., and Palmer, R. M. (2013). The effect of smoking on bone healing: a systematic review. Bone Jt. Res. 2 (6), 102–111. doi:10.1302/2046-3758.26.2000142

PubMed Abstract | CrossRef Full Text | Google Scholar

Pei, F., Zhao, R., Li, F., Chen, X., Guo, K., and Zhu, L. (2020). Osteonecrosis of femoral head in young patients with femoral neck fracture: a retrospective study of 250 patients followed for average of 7.5 years. J. Orthop. Surg. Res. 15 (1), 238. doi:10.1186/s13018-020-01724-4

PubMed Abstract | CrossRef Full Text | Google Scholar

Quan, K. R., Lin, W. R., Hong, J. B., Lin, Y. H., Chen, K. Q., Chen, J. H., et al. (2024). A machine learning approach for predicting radiation-induced hypothyroidism in patients with nasopharyngeal carcinoma undergoing tomotherapy. Sci. Rep. 14 (1), 8436. doi:10.1038/s41598-024-59249-3

PubMed Abstract | CrossRef Full Text | Google Scholar

Saravi, B., Lang, G., Ruff, R., Schmal, H., Südkamp, N., Ülkümen, S., et al. (2021). Conservative and surgical treatment of talar fractures: a systematic review and meta-analysis on clinical outcomes and complications. Int. J. Environ. Res. Public Health 18 (16), 8274. doi:10.3390/ijerph18168274

PubMed Abstract | CrossRef Full Text | Google Scholar

Srinath, A., Southall, W. G. S., Nazal, M. R., Mechas, C. A., Foster, J. A., Griffin, J. T., et al. (2024). Talar neck fractures with associated ipsilateral foot and ankle fractures have a higher risk of avascular necrosis. J. Orthop. Trauma 38 (6), 220–224. doi:10.1097/bot.0000000000002798

PubMed Abstract | CrossRef Full Text | Google Scholar

Vallier, H. A., Reichard, S. G., Boyd, A. J., and Moore, T. A. (2014). A new look at the hawkins classification for talar neck fractures: which features of injury and treatment are predictive of osteonecrosis? J. Bone Jt. Surg. Am. 96 (3), 192–197. doi:10.2106/jbjs.L.01680

PubMed Abstract | CrossRef Full Text | Google Scholar

Wang, Z., Sun, Z., Yu, L., Wang, Z., Li, L., and Lu, X. (2023). Machine learning-based prediction of composite risk of cardiovascular events in patients with stable angina pectoris combined with coronary heart disease: development and validation of a clinical prediction model for Chinese patients. Front. Pharmacol. 14, 1334439. doi:10.3389/fphar.2023.1334439

PubMed Abstract | CrossRef Full Text | Google Scholar

Zhang, C., Zhu, J., Jia, J., Guan, Z., Sun, T., Zhang, W., et al. (2021). Effect of single versus multiple fractures on systemic bone loss in mice. J. Bone Min. Res. 36 (3), 567–578. doi:10.1002/jbmr.4211

PubMed Abstract | CrossRef Full Text | Google Scholar

Zheng, J., Yao, Z., Xue, L., Wang, D., and Tan, Z. (2022). The role of immune cells in modulating chronic inflammation and osteonecrosis. Front. Immunol. 13, 1064245. doi:10.3389/fimmu.2022.1064245

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: machine learning, risk factors, prediction model, avascular necrosis, talar fractures

Citation: Zhang J, Xu J, Yu J, Chen H, Hong X, Zhang S, Wang X and Shen C (2025) Predicting the risk of postoperative avascular necrosis in patients with talar fractures based on an interpretable machine learning model. Front. Bioeng. Biotechnol. 13:1644261. doi: 10.3389/fbioe.2025.1644261

Received: 10 June 2025; Accepted: 18 July 2025;
Published: 31 July 2025.

Edited by:

Wencai Liu, Shanghai Jiao Tong University, China

Reviewed by:

YiPing Luo, Tongji University School of Medicine, China
Kai Liu, First Affiliated Hospital of Xinjiang Medical University, China
Cameron Sabet, Georgetown University Medical Center, United States

Copyright © 2025 Zhang, Xu, Yu, Chen, Hong, Zhang, Wang and Shen. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Xin Wang, ZHIud2FuZ3hpbkBob3RtYWlsLmNvbQ==; Songou Zhang, enNvMDEwQDE2My5jb20=; Chengchun Shen, c2NjMTIwN0BzaW5hLmNvbQ==

^†These authors share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.