Deep Learning Improves Osteonecrosis Prediction of Femoral Head After Internal Fixation Using Hybrid Patient and Radiograph Variables

Femoral neck fractures (FNFs) are a great public health problem that leads to a high incidence of death and dysfunction. Osteonecrosis of the femoral head (ONFH) after internal fixation of FNF is a frequently reported complication and a major cause for reoperation. Early intervention can prevent osteonecrosis aggravation at the preliminary stage. However, at present, failure to diagnose asymptomatic ONFH after FNF fixation hinders effective intervention at early stages. The primary objective of this study was to develop a predictive model for postoperative ONFH using deep learning (DL) methods developed using plain X-ray radiographs and hybrid patient variables. A two-center retrospective study of patients who underwent closed reduction and cannulated screw fixation was performed. We trained a convolutional neural network (CNN) model using postoperative pelvic radiographs and the output regressive radiograph variables. A less experienced orthopedic doctor, and an experienced orthopedic doctor also evaluated and diagnosed the patients using postoperative pelvic radiographs. Hybrid nomograms were developed based on patient and radiograph variables to determine predictive performance. A total of 238 patients, including 95 ONFH patients and 143 non-ONFH patients, were included. A CNN model was trained using postoperative radiographs and output radiograph variables. The accuracy of the validation set was 0.873 for the CNN model, and the algorithm achieved an area under the curve (AUC) value of 0.912 for the prediction. The diagnostic and predictive ability of the algorithm was superior to that of the two doctors, based on the postoperative X-rays. The addition of DL-based radiograph variables to the clinical nomogram improved predictive performance, resulting in an AUC of 0.948 (95% CI, 0.920–0.976) and better calibration. The decision curve analysis showed that adding the DL increased the clinical usefulness of the nomogram compared with a clinical approach alone. In conclusion, we constructed a DL facilitated nomogram that incorporated a hybrid of radiograph and patient variables, which can be used to improve the prediction of preoperative osteonecrosis of the femoral head after internal fixation.

Femoral neck fractures (FNFs) are a great public health problem that leads to a high incidence of death and dysfunction. Osteonecrosis of the femoral head (ONFH) after internal fixation of FNF is a frequently reported complication and a major cause for reoperation. Early intervention can prevent osteonecrosis aggravation at the preliminary stage. However, at present, failure to diagnose asymptomatic ONFH after FNF fixation hinders effective intervention at early stages. The primary objective of this study was to develop a predictive model for postoperative ONFH using deep learning (DL) methods developed using plain X-ray radiographs and hybrid patient variables. A two-center retrospective study of patients who underwent closed reduction and cannulated screw fixation was performed. We trained a convolutional neural network (CNN) model using postoperative pelvic radiographs and the output regressive radiograph variables. A less experienced orthopedic doctor, and an experienced orthopedic doctor also evaluated and diagnosed the patients using postoperative pelvic radiographs. Hybrid nomograms were developed based on patient and radiograph variables to determine predictive performance. A total of 238 patients, including 95 ONFH patients and 143 non-ONFH patients, were included. A CNN model was trained using postoperative radiographs and output radiograph variables. The accuracy of the validation set was 0.873 for the CNN model, and the algorithm achieved an area under the curve (AUC) value of 0.912 for the prediction. The diagnostic and predictive ability of the algorithm was superior to that of the two doctors, based on the postoperative X-rays. The addition of DL-based radiograph variables to the clinical nomogram improved predictive performance, resulting in an AUC of 0.948 (95% CI, 0.920-0.976) and better calibration. The decision curve analysis showed that adding the DL increased the clinical usefulness of the nomogram compared with a clinical approach alone. In conclusion, we constructed a DL facilitated nomogram that incorporated a hybrid of radiograph and patient variables, which can be used to improve the prediction of preoperative osteonecrosis of the femoral head after internal fixation.

INTRODUCTION
Hip fracture is a significant public health concern that affects 4.5 million people worldwide each year and this number is expected to increase to 21 million in the next 40 years (1, 2). Femoral neck fracture (FNF) is one of the most common types of hip fracture, accounting for 49-80% of all hip fractures (3,4). Despite the availability of multiple effective internal fixation procedures, ∼10-48.8% femoral neck fractures require reoperation (5)(6)(7). Osteonecrosis of the femoral head (ONFH) is a major cause of reoperation for FNF (8). Joint disfunction, pain, disability, and mental anguish caused by ONFH result in great suffering for patients (9)(10)(11). End-stage ONFH often inevitably requires artificial joint replacement surgery, an invasive and economically costly technique. Early diagnosis can facilitate the application of interventions that can avoid or delay arthroplasty to a certain extent (12)(13)(14). However, misdiagnoses and delayed diagnoses are common due to the lack of preliminary symptoms, typical features, and internal fixation interference on radiographs (14). Different diagnostic criteria or simple visual estimates are used by radiologists for practical imaging diagnosis, resulting in unsatisfactory levels of diagnostic consistency and accuracy (15). Therefore, early accurate and consistent prediction of ONFH in patients after FNF internal fixation may hold the key for improving patient outcomes.
Deep learning (DL) using radiographs has a proven ability of classifying bone structures and features in specific sites with expert-level accuracy (16,17). Convolutional Neural Networks (CNNs) are the most suitable models for image recognition of DL, and have been widely used for the orthopedic diagnosis of wrists and ankles (18,19). Gale et al. developed a hip fracture detector using DL and achieved an AUC of 0.994 (20). Cheng et al. reported on a deep convolutional neural network (DCNN) for the detection and localization of hip fractures using pelvic radiographs, which achieved an AUC of 0.98 for the identification of hip fractures (21). Recently, Chee et al. made a breakthrough discovery for the diagnosis of early ONFH using radiography through deep learning (22). This model achieved an AUC of 0.93 and sensitivity and specificity that were not inferior to the diagnosis made by both the less experienced and experienced radiologists. Their study indicated the potential of DL for the diagnosis and prediction of ONFH, especially for X-ray imaging. However, the implementation of DL for the diagnosis of postoperative ONFH using digital radiography remains unexplored. Postoperative X-rays are highly affected by interference, such as that of internal fixation devices, which cause difference between the images on radiographs and the original appearance of the femoral neck and femoral head. Since postoperative X-rays are the most common method used for early examination, a consistent diagnosis based on postoperative Xrays made using DL may improve the prediction of postoperative ONFH for better prognosis. In this study, we designed and assessed the diagnostic performance of a DL algorithm based on the CNN network model using postoperative X-rays. We also compared the accuracy of the diagnosis of postoperative ONFH between this DL model and assessments made by two orthopedic doctors of different levels of experience.
In previous studies, a large number of research studies have indicated that patient and interventional variables, including demography, fracture classification, laboratory examination, reduction quality, and initial postoperative rehabilitation, are significantly associated with postoperative ONFH (23)(24)(25)(26). However, intraoperative, and postoperative factors, especially radiographic variables, including intraoperative reduction and fracture healing, have yet to be incorporated into routine clinical postoperative ONFH prediction. In this study, a DL facilitated predictive model using a hybrid of patient and artificial intelligence (AI) radiographic variables, was also developed. Comparisons were made with a single clinical prediction model was performed to estimate whether DL could improve the prediction of postoperative ONFH.

Study Population
Data were obtained from two urban tertiary hospitals, The First Affiliated Hospital of University of Science and Technology of China (FAH) and the Southern Branch of the First Affiliated Hospital of University of Science and Technology of China (SBH). One hundred thirty-nine FAH patients and 99 SBH patients who had received closed reduction and cannulated screw fixation from June 2013 to January 2015 were enrolled in this study. The patient inclusion criteria were as follows: (i) Patients over 18 years of age with fresh FNFs; (ii) Postoperative pelvic radiographs obtained 6 months after surgery; (iii) Continuous follow-up for a minimum of 5 years with the clinical characteristics available. The exclusion criteria were as follows: (i) Pathological fractures and bilateral fractures; (ii) Long-term hormone use. The treatment standard and strategy used for femoral neck fracture was the cannulated compression screws fixation technique, based on American Academy of Orthopedic Surgeons guidelines (27). Postoperative ONFH was diagnosed using pelvic MRIs or co-diagnosis by three experienced orthopedic surgeons based on the pelvic radiograph obtained at the last follow-up. This study was approved by the Ethics Committees of both hospitals. Exemption of the informed consent, the information disclosure, and a negative opportunity are guaranteed in the Ethical approval (20-P-049).
Demographics, comorbidities, smoking status, alcohol use, blood tests, preoperative Garden classification, Pauwels angle, preoperative interval from injury, operation associated data, postoperative Garden index, preoperative interval to weight bearing and other baseline patient and clinical data were derived from medical and follow-up records. The data were de-identified after patient variables were collected.

Imaging Studies
Image acquisition and retrieval procedures were conducted using Picture Archiving and Communication Systems (PACS) on FAH and SBH patients. Digital radiographs of the hip were obtained using Digital Diagnostics (Philips Healthcare) on FAH patients and Discovery XR656 (GE Healthcare) on SBH patients. The size of the stored images varied from 2,128 × 2,248 pixels to 2,688 × 2,688 pixels, with 8-bit grayscale color. Each radiograph was labeled based on the final diagnosis of postoperative ONFH. Geometric, smooth, concave, bandlike low-signal intensity lesions at the femoral head on the T1weighted images were regarded as pathognomonic MRI findings of ONFH. For MRI data not obtained at the last follow-up (45/238, 18.9%), diagnosis was based on pelvic plain radiographs obtained at the last follow-up and was set as a reference for labeling. The Association Research Circulation Osseous (ARCO) classification system was used as the diagnostic standard for ONFH (28).
Radiographic image files were loaded for processing using a MATLAB library (version 2017b, MathWorks, USA). The 7 × 7 cm images centered on the bilateral femoral heads were cropped. The center coordinates were manually recorded in advance. Radiographs were standardized to a common size and pixel intensity distribution. The images were down-sampled and padded to a final size of 120 × 120 pixels. Mean pixel intensity and standard deviation of each image was normalized.

Algorithm Development and Extraction of Image Variables
For the development of a deep learning algorithm, we used MATLAB (version 2017b, MathWorks, USA) to implement a CNN model to compute abstract image features from input image pixel arrays. The design of the CNN model is shown in Table 1. The CNN model consisted of three convolutional blocks, a dropout and full connection layers. Each convolutional block comprised of convolutional operation, batch normalization, relu, and average pooling. The input used was Pixel values were set at 120 * 120 using a digital image. Cubic convolution and pooling were performed on each layer to adjust the weights of the neural network, using the difference between the output and true labels.
The patients in the dataset were assigned to different groups as follows: 149 (63%) for training, 17 (7%) for validation and 72 (30%) for testing. The output results underwent regression analysis. The network output was a probability distribution for the continuous variables of the regression coefficient from 0 to 1.25, which was divided at 0.25 intervals into classified labels, 1-5. Higher label values were more likely to be considered to more strongly predict postoperative ONFH. In this study, this output label was referred to as the AI index classification.

Algorithm Evaluation
Seventy-two independent datasets were used to test the trained predictive model to evaluate its accuracy for postoperative ONFH prediction. The probability of the diagnosis being postoperative ONFH generated by the model was evaluated using the receiver operating characteristic (ROC) curve and the area under the curve (AUC). The sensitivity, accuracy, recall and specificity of the radiographs for the prediction of ONFH were measured using a cutoff level probability of 0.5. A training curve was used to determine root mean squared error (RMSE) and loss, while a precision-recall curve was used to determine precision and recall.

Image Predictive Variable Evaluation
We compared the AI index with the predictive measurement scores assigned by the two orthopedic surgeons of different levels of experience with the results of the DL algorithm based on the same X-rays to evaluate the performance of the algorithm. Radiographs obtained 6-months after anteroposterior hip operations were randomly divided into two IPAC sequences by the study coordinator. A less experienced orthopedic doctor (Doctor A, 3rd year of residency in orthopedics) and an experienced orthopedic doctor (Doctor B, 18 years in orthopedics) participated in the reading session. Both doctors were not involved in surgery, data collection or reference labeling. A score based on the subjective prediction of the doctors using the postoperative X-ray to determine the most likely outcome at final follow-up was assigned using a 1-5 grading system. One indicated that the development of ONFH was considered to be impossible, while 5 indicated that the development of ONFH was considered to be certain. Each doctor independently graded the predictive variables for ONFH. Comparison between the performance of the AI index and the evaluation made by the two doctors was conducted through calibration and ROC analysis.

Development of Prediction Models
A multivariable logistic regression analysis was used to develop the clinical predication model based on patient and clinical variables. AI index classification was applied as a candidate predictor for univariate and multivariable logistic regression analyses for the construction of a DL-based postoperative ONFH prediction model using hybrid variables. A clinical prediction nomogram and a DL-based nomogram were then constructed based on multivariate logistic regression models. The work flowchart of this study is presented in Figure 1.

Assessment of Nomogram Performance
AI-based nomogram and clinical nomogram calibration were assessed using a calibration curve. The discrimination performance of both the AI-based nomogram and clinical nomogram were quantified using the AUC.

Clinical Use
Decision curve analysis (DCA) was performed by calculating the net benefits for a range of threshold probabilities to estimate the clinical utility of the nomogram.

Statistical Analysis
Median and mean standard deviation (SD) were used to describe continuous variables. Categorical variables were presented as frequencies and percentages. Statistical comparisons between groups were performed using the Mann-Whitney U-test and Chi-square test. R software version 3.0.1 was used to construct the nomogram. The "pROC" package was used to plot ROC curves. Nomogram construction and calibration plot creation were performed using the "rms" package. DCA was performed using the "dca.R" package. Model selection was based on the forward-backward step-wise method using the likelihood ratio test with Akaike's information criterion as the stopping rule. The model with the smallest Akaike Information Criterion was selected as the final model. The statistical significance levels reported are all two-sided, with statistical significance set at a P-value of 0.05.

Patient and Radiograph Characteristics
Postoperative radiographs of a total of 238 patients, including 95 ONFH patients and 143 normal patients were used for the development of the DL model and construction of the predictive nomogram. Imaging feature variables were extracted from each radiograph and were referred to as the AI index of all patients. Table 2 shows the baseline characteristics of the patients. Significant differences were found in BMI, Charlson comorbidity index, Injury Severity Score (ISS), d-dimer, timing of reduction, Garden classification and AI index between patients with ONFH and those without ONFH ( Table 2).

Performance of the CNN Model
A CNN model was established for the extraction of radiograph variables. The precision-recall curve of the test set is shown in Figure 2A, while the threshold value at the break-even point was 0.425. This point was set as the highest sum of sensitivity and specificity. Training accuracy values at this threshold for the training set was 0.903 and 0.873 for the test set. The change in RMSE and loss during the training process are shown in Figure 2B. Deviation of the RMSE in the training set and test set gradually decreased and the two curves leveled off (upper diagram) along with the increase of iterations. Similarly, as the number of iterations increased the deviation in loss between the training set and test set gradually decreased.

Performance of the Predictive Radiograph AI Variables
The calibration curve of the AI index for the prediction of postoperative ONFH demonstrated good agreement between prediction and actual observations, compared with that of Doctor A and Doctor B (Figure 3A). The sensitivity value was 0.910 (95% CI, 0.871-0.949) for the AI index, 0.657 (95% CI, 0.591-0.724) for the less experienced Doctor A and 0.827 (95% CI, 0.776-0.879) for experienced Doctor B (Figure 3B). The DCA curves shown in Figure 3C indicate that when the threshold probability for a doctor or a patient was within the range of 0.09-0.96, the AI index added more net benefits for the prediction, than that of Doctor A or Doctor B.   It showed that if the threshold probability is between 0.09 and 0.96, then using the AI index adds more benefit than testing either all or no patients.

Development of a Hybrid Prediction Model
In the univariate logistic regression analysis, BMI, Injury Severity Score (ISS), timing of reduction, Garden classification and AI index were found to be significant factors associated with ONFH in the training cohort (all P < 0.05;  Table 3). We then created a prediction nomogram that incorporated the above independent predictors and presented it as a hybrid nomogram (Figure 4A). A clinical nomogram was also constructed based on independent predictors excluded from the AI index ( Figure 4B).

Performance of the Hybrid Nomogram
The calibration curve of the hybrid nomogram for the prediction of postoperative ONFH demonstrated good agreement between prediction and actual observations, compared with that of the clinical nomogram ( Figure 5A). The AUC of the AIbased nomogram was 0.948 (95% CI, 0.920-0.976), while the AUC for the clinical nomogram was 0.696 (95% CI, 0.629-0.763) (Figure 5B). The difference was statistically significant, which indicated that the hybrid nomogram showed better discrimination and prediction ability for the diagnosis of ONFH.

Clinical Use
The DCA for the hybrid nomogram and for the clinical nomogram are presented in Figure 5C. The DCA indicated that when the threshold probability for a doctor or a patient was within the range of 0-0.98, the hybrid nomogram added more net benefits than "treat all" or "treat none" strategies. The range for the clinical nomogram was from 0.2 to 0.7, revealing that use of the hybrid nomogram to predict postoperative ONFH was more beneficial.

DISCUSSION
Early detection and identification of ONFH after femoral neck fracture fixation has been a long-term concern in clinical practice.
In this study, we developed and trained a DL model that could use postoperative pelvic radiographs to predict ONFH. The output values of the CNN model successfully stratified patients based on their risk of developing postoperative ONFH, which was referred to as AI index classification for prediction. The predictive  The threshold probability is where the expected benefit of treatment is equal to the expected benefit of avoiding treatment. It showed that if the threshold probability is between 0 and 0.98, then using the AI-based nomogram adds more benefit in predicting ONFH than testing either all or no patients.
performance of the AI index was significantly superior to the predictive performance of a less experienced orthopedic doctor and non-inferior to that of an experienced orthopedic doctor. A combination of patient and radiograph variables were used to construct an AI-based nomogram for postoperative ONFH prediction. The hybrid nomogram showed better performance for the postoperative prediction of ONFH than a single clinical nomogram, indicating its potential in predicting and targeting ONFH during clinical follow-up to provide a decision base for orthopedic doctors. Hip pain is the most common postoperative symptom after FNF surgery. It may be associated with fractures, surgery, implant irritation, and early ONFH that should be identified during follow-up. Postoperative X-rays are the most common and readily available imaging examination used for routine clinical follow-up after internal fixation. The detection of sclerotic abnormalities and trabecular interruptions of the femoral head for the diagnosis of postoperative ONFH are subjective and depend on the level of experience and diagnostic criteria used by each doctor. Only radiologists who are rich in experience, may be able to accurately predict ONFH using postoperative Xrays. Even then, objectivity and consistency may be difficult to be achieved. The increased workload of radiologists worldwide has already had a significant impact on the diagnostic performance of radiologists (29,30). Therefore, DL can be used as a potential auxiliary diagnostic tool for orthopedic diagnoses to obtain stable and accurate diagnoses (16,31). In this study, we trained a DL model to read postoperative X-rays to predict ONFH. The accuracy and consistency of the DL model was significantly better than that of an orthopedic doctor with less experience. The DL model was similar in accuracy but better in consistency, compared with the experienced orthopedic doctor. This indicated the potential of the use of the DL model for the diagnosis and prediction of postoperative ONFH. Previous studies have indicated that an important feature of the DL model is its ability to detect key features of images through cyclic learning undergone by neural networks, which may be different from the existing understanding and research on image features in black box models. This makes it possible for the diagnostic path of the DL model to differ from existing known diagnostic and prediction criteria, resulting in a positive difference in the diagnostic accuracy of the DL model, compared with that of orthopedic doctors. The DL model created in Chee's study showed a high level of sensitivity and accuracy for the diagnosis of pre-collapse ONFH (22). When we applied the CNN network obtained from this non-traumatic ONFH prediction model to our postoperative ONFH prediction, internal fixation of the postoperative X-ray was found to be one of the major differences between the two models. Recent studies have suggested that different fixation constructs, such as cannulated screws or dynamic hip screws, produce different fracture fixation outcomes. The location differences under the implemented operations standard for the same fixation construct do not significantly affect outcomes (32). During training, we found that the output of the DL model could still reflect prediction efficiency and showed good calibration, even though the positions of the metal internal fixations were not exactly the same and occupied the recognition area in the finite image pixel.
Existing studies using clinical risk factors, such as demographic data, fracture classification, and preoperative interval, to make preoperative predictions for surgical decisions (33)(34)(35). Due to the lack of the incorporation of all perioperative variables, especially the intraoperative and postoperative radiograph variables, the preoperative prediction models in these studies have shown difficulties in achieving an ideal predictive ability. For example, the clinical nomogram constructed in our study achieved an AUC of 0.696 (95% CI, 0.629-0.763), which is similar to the AUC of 0.746 obtain by the Naive Bayes Classifier constructed by Cui et al. (36). The predictive ability of a preoperative model is limited for patients who have received certain internal fixation, for example dynamic hip screws and cannulated compression screws (34,36). The hybrid nomogram showed better prediction performance after the incorporation of patient and radiograph variables, compared with conventional clinical nomograms and the simple radiographic-based DL model for postoperative ONFH prediction. In this study, the hybrid classifier achieved an AUC of 0.948 (95% CI, 0.920-0.976). The variables we included after multivariate regression analysis of all risk factors were similar to that of conventional preoperative clinical prediction models. High-risk factors generally include fracture patterns, preoperative interval, and BMI. Inclusion of the DL model-based imaging prediction significantly improved the ONFH predictive ability of the traditional prediction models, indicating the value of using a combination of variables. The predictive model using hybrid variables more closely mimicked the diagnostic and predictive processes of orthopedic doctors, who are better at interpreting images based on the clinical status of patients (37). The addition of a combination of patient and hospital process variables associated with routine clinical care improved the ability of a DL model trained by Badgeley et al. to predict hip fractures (38). One explanation for this improvement was the presence of non-biological signals on radiographs that are predictive of diseases (39). Although multiple regression analyses were performed for risk factors, including intraoperative reduction, and postoperative weight bearing, the variables included in the single clinical nomogram were all preoperative variables. Among them, Garden classification showed the most assigned value, which was similar to the results of previous studies that found that fracture patterns are crucial for the prediction of postoperative ONFH (7,40). When the postoperative AI index was included, the attribution of Garden classification decreased significantly, which may be because the AI index already included certain manually incorporated graded variables from the images. The information was considered as a non-biological signal and contributed to the classification. The DL-based prediction model that incorporated a combination of patient and radiograph variables showed a significantly higher ability of prediction postoperative ONFH, and can be used to provide second opinions and a base for doctors to make decisions during clinical follow-up.
In the DCA curves analysis, prediction and diagnosis based on the DL model were found to be non-inferior to that of the two orthopedic doctors, while that of the AI-based nomogram using hybrid variables was superior to imaging prediction alone, allowing for more accurate diagnosis and prediction during clinical follow-up. There is no doubt that the gold standard imaging modality for the preliminary stages of ONFH is MRI (41,42). However, MRI is not the most common test used to evaluate treatment options and ONFH during postoperative FNF follow-up. MRIs are affected by metal implants, which may cause potential internal fixation losses and thermal effect (43). MRI tests are more expensive, take longer, and require the radiologist to have a higher level of diagnostic experience. Nomograms based on the DL model and clinical variables can improve the ability of positive diagnostic screening and provide doctors the opportunity of obtaining a second opinion.
The AI-based nomogram using hybrid variables may potentially assist in decision making during clinical followup as patients with early-stage ONFH may benefit from timely interventions (44). Although the definitive method of treatment for traumatic ONFH remains controversial, certain early interventions have been widely used during post-operative clinical follow-up. For patients with a high probability of developing ONFH, interventions for hip preservation or delayed joint replacement, including platelet-rich plasma (PRP)incorporated autologous granular and free vascularized fibular, have been proven to be safe and effective procedures for postoperative ONFH (45,46). Extracorporeal shock wave therapy and alendronate administration can also be potentially performed on patients with a moderate probability of a risk of developing ONFH (47)(48)(49). We assessed whether the AIbased nomogram assisted decisions that would improve patient outcomes to justify its clinical usefulness. Our study showed that if the threshold probability was between 0.06 and 0.96, as shown by the constructed decision curves, the AI-based nomogram could predict postoperative ONFH compared with treating either all or no patients. This indicated that early postoperative prediction using this hybrid of patient and radiograph variables can be useful for the application of early interventions that may even allow for a reasonable delay of the onset of arthroplasty (50). Substantial positive rehabilitation can be applied after accurate predictions are obtained after the operation for patients with a lower prediction probability, which will also relieve patient anxiety (51).
This study has some limitations. First, it was conducted on a retrospective cohort study, and is therefore likely to have been affected by selection bias. Second, due to the rarity of the disease, our study included only 238 images in the CNN model. The performance of the CNN model can be improved by using a larger multicenter sample size. Third, our diagnostic criteria for postoperative ONFH was based on follow-up MRIs and typical pelvic radiographs without the use of histopathological confirmation. Therefore, false-negative and false-positive values would not have been avoided due to the subjectivity of the imaging diagnosis method. At the same time, transverse comparison was not conducted with gold standard MRI when postoperative X-rays were included 6 months after surgery. The reason was that, as a retrospective study, MRIs had been performed on only 197 patients, probably due to their high cost. In the future, prospective clinical studies using larger cohorts should be preplanned to investigate strategies that can be used for ONFH prediction of patients after internal fixation.

CONCLUSION
In conclusion, this study presents a DL facilitated nomogram that incorporates hybrid radiograph and patient variables, shows favorable predictive accuracy for preoperative osteonecrosis of femoral head in patients with femoral neck fractures after internal fixation.

DATA AVAILABILITY STATEMENT
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

ETHICS STATEMENT
The studies involving human participants were reviewed and approved by the First Affiliated Hospital of USTC. The patients/participants provided their written informed consent to participate in this study.