Machine learning models predict coagulopathy in traumatic brain injury patients in ER

Wang, Haoyu; Cao, Wenying; Huang, Jianhuang; Feng, Yuxing; Li, Cheng

doi:10.3389/fneur.2025.1649869

ORIGINAL RESEARCH article

Front. Neurol., 18 September 2025

Sec. Neurotrauma

Volume 16 - 2025 | https://doi.org/10.3389/fneur.2025.1649869

Machine learning models predict coagulopathy in traumatic brain injury patients in ER

Haoyu Wang¹^†

Wenying Cao²^†

Jianhuang Huang³^*

Yuxing Feng²^*

Cheng Li²^*

¹Department of Neurosurgery, Chongqing Ninth People's Hospital, Chongqing, China
²Department of Neurology, The Ninth People’s Hospital of Chongqing, Chongqing, China
³Department of Neurosurgery, Affiliated Hospital of Putian University, Fujian, China

Traumatic brain injury (TBI) is a critical emergency condition, with 15–35% of patients developing coagulopathy, increasing risks of secondary brain injury and mortality. We developed a machine learning model to predict coagulopathy in TBI patients in the emergency room. Using data from 322 TBI patients (mean age 55.7 ± 21.1 years, coagulopathy incidence 15.8%) at Chongqing Ninth People’s Hospital (2018–2024), we collected clinical and laboratory data (GCS scores, blood counts, liver function). Data were preprocessed in R, using SMOTE for class imbalance and selecting top 70% features by information gain. Among 11 algorithms, Random Forest (RF) achieved the best performance (AUC = 0.92, recall = 0.94, false negative rate = 6%), outperforming coagulation tests. Neutrophil percentage, A/G ratio, and ALT were key predictors, reflecting inflammation and liver dysfunction. SHAP analysis enhanced model interpretability. This model supports rapid risk stratification for early intervention, though multi-center validation is needed.

1 Introduction

Traumatic brain injury (TBI) is one of the most common critical conditions in emergency departments. Globally, an estimated 50 million TBI cases occur annually, significantly increasing mortality and disability risks and presenting a major public health challenge (1–3). Research indicates that approximately 35% of TBI patients exhibit coagulopathy upon admission (4–6), which can lead to secondary brain injuries (such as hematoma expansion and new hemorrhages), substantially worsening patient outcomes (5, 7, 8). Therefore, timely identification and management of coagulopathy is not only a core component of emergency TBI treatment but directly impacts patient prognosis (9–11).

Currently, coagulation function assessment relies on traditional testing methods such as prothrombin time (PT) and activated partial thromboplastin time (APTT). However, these methods are time-consuming (typically requiring tens of minutes to hours) and have limited sensitivity for trauma-induced coagulopathy (12–15). Recent studies have revealed that TBI-related coagulopathy is closely associated with multiple factors including inflammatory response, vascular endothelial damage, and impaired hepatic synthesis of coagulation factors (5, 16, 17). These complex mechanisms necessitate rapid, comprehensive predictive tools to meet the time-sensitive demands of emergency care.

Machine learning (ML) can integrate high-dimensional data to discover predictive patterns that traditional statistical methods struggle to capture, demonstrating superior potential to conventional models in trauma medicine. While recent ML applications in emergency risk stratification have significantly improved prediction accuracy, research on rapid prediction of TBI-associated coagulopathy remains limited, with existing models often lacking interpretability or real-time applicability. This study aims to develop and validate a machine learning-based prediction model using routine clinical and laboratory data available in emergency departments to rapidly identify high-risk TBI patients for coagulopathy, supporting early intervention and facilitating precise emergency management.

2 Methods

2.1 Data source

This study utilized the TBI database from Chongqing Ninth People’s Hospital, collecting clinical and laboratory data from emergency department TBI patients between January 2018 and December 2024. The target was to predict whether patients would develop coagulopathy (binary variable). Coagulopathy was defined as elevated International Normalized Ratio (INR ≥ 1.2) or prolonged activated partial thromboplastin time (APTT, reference range 28–34 s) (14, 18, 19). Clinical features collected included demographic information (e.g., age, gender), comorbidities (e.g., hypertension, diabetes), trauma-related indicators (e.g., GCS score, TBI type, injury location), and laboratory results (e.g., complete blood count, liver and kidney function tests, electrolytes). Inclusion criteria were: age≥18 years, CT-confirmed TBI, and complete laboratory data within 2 h of admission. Exclusion criteria included: severe non-cranial polytrauma, pre-existing coagulation disorders (e.g., hemophilia), use of anticoagulant medications (to avoid confounding factors), or missing critical data. This study was approved by the Ethics Committee of Chongqing Ninth People’s Hospital (Ethics approval number: 2025011). Due to the retrospective design of this study, the Ethics Committee waived the requirements for informed consent and clinical trial registration. All data were collected and analyzed anonymously, with no potential harm to patients. This research was conducted in accordance with the Declaration of Helsinki (2013 revision). Figure 1 illustrates the technical roadmap of this study.

Figure 1

Flowchart detailing the processing of a traumatic brain injury database with 658 cases. 405 cases met the inclusion criteria, and 322 passed the exclusion criteria. 83 cases had missing key data. Steps include data preprocessing with cleaning and feature engineering, model selection with evaluation and tuning, model training with performance evaluation and error analysis, and model interpretation with global and local analysis.

Figure 1. Technical roadmap of this study.

2.2 Data preprocessing

Data preprocessing was completed in R 4.3.3 environment. The dataset was split in a 7:3 ratio (stratified sampling to ensure balanced coagulopathy proportions) into training and testing sets. The R package “dplyr” (V1.1.4) (20) was used to handle missing values, with numerical variables imputed using median values and categorical variables imputed using mode values. Categorical variables were converted to numerical form through one-hot encoding. Numerical features were centered (mean = 0) and standardized (standard deviation = 1) using “mlr3pipelines” (V0.7.2) (21). To address data imbalance, the R package “smotefamily” (V1.4.0) (22) applied the “SMOTE” technique (K = 5, replication factor = 5) to generate synthetic samples, and 70% of the most predictive features were selected based on information gain, removing zero-variance features. The final dataset was defined as a binary classification task (ID “Coagulation”) through the R package “mlr3” (V0.23.0) (23) framework, using 5-fold cross-validation to divide training and validation data.

2.3 Model selection and rapid screening

To select appropriate machine learning algorithms, the mlr3 framework was used to preliminarily evaluate the default parameter performance of 11 models, including Random Forest (RF), Gradient Boosting Machine (GBM), XGBoost, Elastic Net Regularized Logistic Regression, Naive Bayes, Decision Tree, Logistic Regression, k-Nearest Neighbors (k-NN), Support Vector Machine (SVM), Linear Discriminant Analysis (LDA), and Single-Layer Neural Network. The evaluation process employed 5-fold cross-validation, with Area Under the Curve (AUC) as the primary metric, supplemented by accuracy, precision, recall, and F1 score. The best-performing Random Forest, Gradient Boosting Machine, and Support Vector Machine were selected for hyperparameter tuning.

2.4 Hyperparameter tuning

Random Forest optimization parameters included number of trees (100–500), number of features for splitting (2–12), minimum node size (1–8), and maximum number of nodes (3–10), using random search (50 iterations) via the R package “mlr3tuning” (V1.3.0) (24). Gradient Boosting Machine optimized number of trees (50–1,000), interaction depth (3–10), and learning rate (0.01–0.1) using the Hyperband algorithm (25). Support Vector Machine optimized kernel type (linear/polynomial/radial), regularization parameter (0.1–10), kernel parameter (0.01–1), and polynomial kernel degree (2–5) using random search. Tuning targeted AUC, incorporating 5-fold cross-validation and comprehensively evaluating AUC, accuracy, precision, recall, and F1 score. The best-performing model was selected for further analysis.

2.5 Final model training and evaluation

Random Forest, showing the best performance, was selected as the final model, with optimal hyperparameters (number of trees = 225, features for splitting = 8, minimum node size = 2, maximum nodes = 5) used for retraining on the complete dataset, configuring probability output to support subsequent analysis, implemented through the mlr3 framework. Model performance was evaluated through 5-fold cross-validation, reporting mean and median values of AUC, accuracy, precision, recall, and F1 score, along with confusion matrix analysis of classification error distribution. Out-of-bag error of Random Forest was used to verify the adequacy of decision tree quantity. The model output coagulopathy probability, optimized based on the ROC curve, with a probability threshold of 0.6 (corresponding to SHAP value>0.7) selected to balance sensitivity and specificity.

2.6 Model interpretation

To enhance model interpretability, global and local analysis methods were employed. At the global level, the DALEX package calculated feature contributions to predictions through Dropout Loss, with results visualized as bar charts using the R package “ggplot2” (V3.5.1) (26). At the local level, the R package “shapviz” (V0.9.7) (27) calculated SHAP values for 150 random samples, analyzing feature contributions to individual predictions, visualized with bee swarm plots and force plots, with the background dataset randomly sampled from 100 preprocessed data points.

2.7 Statistical analysis

Statistical description and analysis were completed in R 4.3.3 environment. Continuous variables were described as mean±standard deviation, categorical variables as frequency (percentage), group comparisons used Wilcoxon rank-sum test or chi-square test, with significance level set at p < 0.05.

3 Results

3.1 Clinical characteristics of the cohort

Between 2018 and 2024, emergency department admitted 658 acute TBI patients, of whom 405 met preliminary inclusion criteria. After excluding those with missing key data or other disqualifying conditions (n = 83), 322 patients (mean age 55.7 ± 21.1 years, 61% male) were ultimately included for analysis. Fifty-one patients (15.8%) were diagnosed with coagulopathy. Table 1 summarizes baseline characteristics. Significant clinical differences were observed between coagulopathy and non-coagulopathy groups: (1) Trauma severity: coagulopathy group had significantly lower GCS scores than non-coagulopathy group (7.7 ± 2.7 vs. 9.4 ± 2.5, p < 0.001). (2) Inflammatory response: coagulopathy group showed significantly elevated neutrophil percentage (85.0% ± 11.0% vs. 70.0% ± 11.0%, p < 0.001). (3) Liver function indicators: coagulopathy group had higher ALT levels (28.1 ± 7.1 vs. 22.3 ± 5.1 U/L, p < 0.001) and lower A/G ratio (1.31 ± 0.32 vs. 1.72 ± 0.24, p < 0.001). (4) Hematological indicators: coagulopathy group had lower hemoglobin (Hb) levels (104.3 ± 11.7 vs. 117.8 ± 9.8 g/L, p < 0.001), and significantly elevated white blood cell count (WBC) and neutrophil count (NEUT) (p < 0.001). Other parameters such as serum sodium, potassium, calcium, and gender showed no significant differences (p < 0.05, Supplementary Table 1).

Table 1

Table 1. Characteristic of TBI patients with coagulopathy and without coagulopathy.

3.2 Model selection and initial performance screening

Using the mlr3 framework, we preliminarily evaluated the default parameter performance of 11 machine learning algorithms (Table 2). Although recall might be the primary consideration for model performance in emergency settings, all models maintained high recall levels (0.93–0.97), so an AUC-based optimization strategy better aligned with clinical risk stratification needs. Results showed that GBM algorithm (AUC = 0.89), RF algorithm (AUC = 0.88), and SVM algorithm (AUC = 0.87) performed best and were selected for subsequent hyperparameter tuning.

Table 2

Table 2. Performance of 11 algorithm models with default parameters.

3.3 Optimal model establishment and evaluation

After hyperparameter tuning, we obtained performance parameters for these three algorithms (Table 3). The RF algorithm performed best (Figure 2), with test set AUC improving from 0.87 to 0.92 (ΔAUC = +0.04), and median cross-validation AUC of 0.91 (IQR 0.89–0.93). In comparison, GBM and SVM algorithms improved to AUCs of 0.90 and 0.88 respectively, while RF algorithm training time was only 15% of GBM’s (2.3 min vs. 15 min), making it more suitable for real-time prediction needs in emergency settings. Additionally, AUC and PRC curves showed RF model superiority over GBM and SVM (Figures 3a,b). Classification error analysis indicated (Figure 3c) that RF had a negative prediction error rate as low as 6%, outperforming GBM (9%) and SVM (17%). Therefore, Random Forest algorithm was the optimal model. RF optimal parameters were: number of decision trees = 225, features for splitting = 8, minimum node size = 2. Model robustness analysis showed that when tree number>220, OOB error entered a stable plateau (<0.08), indicating 225 trees sufficiently balanced model complexity and prediction accuracy. Error rates were 0.13 (95%CI 0.10–0.16) for the coagulopathy group and 0.05 (95%CI 0.03–0.07) for the non-coagulopathy group, with McNemar test showing no significant difference between group errors (p = 0.12, Figure 3d).

Table 3

Table 3. Performance indicators of optimized random forest, GBM and SVM.

Figure 2

Box plots showing the distribution of four different metrics (AUC, Recall, F-beta, ACC) across six algorithms (Random Forest, XGBoost, GLMnet, Bayes Algorithm, GBM, Decision Tree) related to coagulation. Each plot (a, b, c, d) illustrates variations in performance metrics among the algorithms.

Figure 2. Performance indicators of GBM, SVM and RF algorithms after parameter tuning, with horizontal lines representing median performance indicators under full parameters. (a) AUC distribution of various algorithms. (b) Recall distribution of various algorithms. (c) F-beta distribution of various algorithms. (d) ACC distribution of various algorithms.

Figure 3

Four panels comparing machine learning models. Panel (a) shows ROC curves for GBM, Random Forest, and SVM models with True Positive Rate versus False Positive Rate. Panel (b) displays Precision-Recall curves for these models. Panel (c) illustrates a bar chart of classification error rates, indicating SVM has the highest error. Panel (d) presents a line graph of error rates over decision trees, comparing models with and without coagulopathy against the OOB metric.

Figure 3. Random Forest model is the optimal model. (a,b) AUC and PRC show RF algorithm outperforms GBM and SVM. (c) Classification error chart shows RF algorithm has minimal classification error. (d) Out-of-bag error chart suggests current RF algorithm parameters are well optimized.

Random Forest model feature importance analysis revealed neutrophil percentage, A/G ratio, ALT, hemoglobin (Hb), neutrophil count, white blood cell count, GCS score, and lymphocyte percentage as key predictors (Figure 4a). SHAP analysis further revealed: (1) Positive drivers: neutrophil percentage>90% contributed the highest SHAP value, suggesting intense inflammatory response as a core inducer of coagulopathy; A/G < 1.15 and ALT>33.4 U/L indicated liver dysfunction, associated with decreased coagulation factors. Hb < 109.5 g/L and white blood cell count>20.2 × 10⁹/L correlated with increased coagulopathy risk, reflecting contributions of trauma-related blood loss and inflammatory response. (2) Protective factors: GCS score>8 (non-linear relationship) and lymphocyte percentage>15% (anti-inflammatory state) correlated with reduced coagulopathy risk (Figures 4b,c).

Figure 4

Chart panel illustrating feature importance in a model. Panel (a) displays a bar graph ranking features by importance, with NEUT%, A/G, and ALT as top contributors. Panel (b) depicts a SHAP summary plot with color-coded features indicating positive or negative impact on predictions. Panel (c) shows a flow chart delineating feature contributions to a final prediction, highlighting NEUT%, A/G among others.

Figure 4. Feature importance of Random Forest model and impact on prediction. (a) Bar chart shows feature importance ranking based on optimal parameters of Random Forest model. (b,c) Bee swarm plot and force plot show feature impact on predictions.

4 Discussion

This study developed and validated a Random Forest-based machine learning model for predicting coagulopathy risk in emergency department TBI patients. RF demonstrated excellent discriminative ability (AUC = 0.92, median cross-validation AUC = 0.91) and recall (Recall = 0.95, median cross-validation Recall = 0.93), outperforming other candidate algorithms (such as Gradient Boosting Machine and Support Vector Machine). With a training time of only 2.3 min and false negative rate as low as 6%, the model significantly outperforms traditional coagulation tests (such as INR and APTT), which are limited by long processing times and inconsistent diagnostic thresholds (12–14). By integrating routine clinical and laboratory data from emergency departments, this model can rapidly identify high-risk populations for coagulopathy, supporting early intervention and facilitating precise emergency management. These results demonstrate the potential application of machine learning technology in emergency medicine, particularly in time-sensitive TBI treatment scenarios.

4.1 Model performance and clinical significance

The excellent performance of the RF model benefits from its powerful capability to process high-dimensional, heterogeneous data (28). The study found that the model’s negative prediction error rate was as low as 6%, which is particularly important in emergency settings, as missed diagnosis of coagulopathy can lead to serious consequences such as hematoma expansion (5, 7, 8). Compared to existing literature, this study’s AUC (0.92) is higher than previous TBI coagulopathy prediction models based on logistic regression (AUC = 0.80–0.85), demonstrating the advantages of ensemble learning methods (6, 17). Additionally, the RF model’s training time was only 15% of the Gradient Boosting Machine’s, which is crucial for emergency prediction systems requiring rapid deployment (29). The model’s robustness was validated through out-of-bag error (OOB error<0.11) and McNemar test (p = 0.12), indicating consistent performance across different data subsets and suitability for dynamic emergency environments (30). Furthermore, traditional coagulation tests (CCAs) face issues of inconsistent diagnostic standards and variable positivity rates in TBI patients, whereas this study’s model achieved an AUC of 0.92, significantly outperforming single indicators of CCAs (such as INR or APTT), and integrated dynamic indicators including inflammation and liver function, better aligning with the multi-factorial pathological mechanisms of TBI coagulopathy (12–15). The importance of early identification of coagulopathy in reducing rebleeding risk is well-established; this model’s rapid prediction capability (training time of only 2.3 min) can support emergency departments in initiating targeted treatments (such as tranexamic acid or PCC infusion) within the golden hour, consistent with guideline-recommended “damage control” strategies (31, 32).

Notably, this study prioritized optimizing AUC rather than recall, although the latter is typically more critical in emergency settings (33, 34). This approach was justified as all candidate models maintained high recall levels (0.93–0.96), making AUC optimization more suitable for achieving risk stratification, thereby helping clinicians precisely identify high-risk patients and rationally allocate medical resources. Future research could further explore model performance under different recall thresholds to meet specific clinical needs (such as maximizing sensitivity to reduce missed diagnosis rates).

Additionally, the coagulopathy incidence reported in this study (15.8%) is lower than the 35% mentioned in the introduction (4–6), possibly due to the exclusion of anticoagulant users and adoption of stricter coagulopathy definitions (INR ≥ 1.2 or APTT>34 s). Furthermore, coagulopathy risk increases significantly with trauma severity, and the relatively low proportion of severe TBI in this study sample (approximately 20% of patients with GCS ≤ 8) may further explain the lower overall incidence.

4.2 Key predictors and their mechanisms

Feature importance analysis indicates that the percentage of neutrophils, A/G ratio, ALT, hemoglobin (Hb), neutrophil count, white blood cell count, GCS score, and lymphocyte percentage are core factors in predicting TBI-related coagulopathy. Neutrophil percentage was the primary predictor, confirming that the inflammatory cascade reaction triggered by TBI is a core driver of coagulopathy (5, 35, 36). Post-TBI inflammatory response leads to coagulation factor consumption through activation of tissue factor (TF) and endogenous anticoagulation pathways (such as protein C pathway), consistent with the high contribution of neutrophil percentage in this study (37, 38). Additionally, neutrophil extracellular traps (NETs) released by neutrophils may exacerbate hyperfibrinolysis, warranting further investigation (5). Low A/G ratio and high ALT suggest liver dysfunction, which may directly relate to decreased ability to synthesize coagulation factors after TBI (39, 40). This might be associated with the liver’s role in synthesizing anticoagulant factors (such as antithrombin III) (41, 42), consistent with Tsai et al.’s findings that the De Ritis ratio (AST/ALT) correlates with trauma severity, indicating that liver function metrics may serve as biomarkers for coagulopathy risk in TBI patients (43). Low Hb levels, a significant predictive factor, may reflect trauma-associated occult bleeding or inflammation-induced suppression of erythropoiesis (13). Prisco et al. also identified low Hb as a vital predictor of mortality in severe TBI patients, highlighting the strong link between trauma severity and coagulopathy (44). Additionally, patients in the coagulopathy group were older, had lower Hb levels, and poorer GCS scores. These findings align with Depreitere et al.’s research on the vulnerability of elderly trauma patients, suggesting that reduced physiological reserve in older TBI patients may lead to more severe clinical outcomes (45). However, the incidence of coagulopathy among older patients in this study was not significantly increased, likely due to the low proportion of severe TBI cases (approximately 20% with GCS ≤ 8) and the stringent definition of coagulopathy (INR ≥ 1.2 or APTT > 34 s). Furthermore, GCS score>8 and high lymphocyte percentage as protective factors, associated with milder trauma and anti-inflammatory states, may reduce coagulopathy risk by alleviating systemic inflammatory burden (46–48). Severe TBI patients with GCS ≤ 8 have a coagulopathy incidence as high as 60% (49), consistent with this study’s findings (risk sharply increases when GCS ≤ 8). The elevation in white blood cell count further supports the critical role of inflammation in the occurrence of coagulopathy. These indicators are readily obtainable in routine emergency assessments, enhancing the model’s clinical applicability. The predictive factors identified in this study are consistent with existing literature, providing new insights into the pathophysiological mechanisms underlying TBI-related coagulopathy.

4.3 Clinical application scenarios

This model can be integrated into emergency decision support systems, supporting the following scenarios: (1) High-risk patient stratification: SHAP value>0.7 corresponds to model prediction probability>0.6; based on cross-validation recall>0.93 and false negative rate<6%, it is suitable for identifying high-risk patients and prioritizing thromboelastography testing to guide individualized blood transfusion (such as fibrinogen concentrate). (2) Dynamic monitoring: inputting updated laboratory data every 6 h within 24 h after admission to dynamically update prediction probabilities and capture delayed coagulopathy. (3) Resource optimization: prioritizing intensive care resources for high-risk patients and optimizing emergency processes. Combined with “damage control” strategies (such as tranexamic acid), effects can be further enhanced (50).

4.4 Strengths and limitations

The main strengths of this study include: (1) Utilization of real-world data, enhancing the model’s generalizability; (2) Application of SMOTE technique to address data imbalance, ensuring the model’s predictive ability for the minority class (coagulopathy group) (51, 52); (3) Provision of global and local explanations through SHAP and DALEX analyses, enhancing model interpretability and helping clinicians understand prediction results (53). However, the study also has limitations, including: (1) The data is sourced from a single institution with a limited sample size (n = 322), and there is a significant imbalance between the bleeding disorder group (n = 51) and the non-bleeding disorder group (n = 271), reflecting the actual incidence of TBI-related bleeding disorders (15.8%). Although we addressed the data imbalance using SMOTE to enhance the model’s predictive capability for the minority class (bleeding disorder group), this imbalance may still impact the model’s generalizability, especially in different populations or healthcare settings. Therefore, future multicenter studies should include larger sample sizes and more balanced groups to further validate the model’s robustness and applicability.(2) Exclusion of antiplatelet medications due to missing data, potentially underestimating bleeding risk in some patients. (3) Real-time deployment of the model still requires validation, with future needs for developing user-friendly interfaces and conducting prospective testing to evaluate its performance in actual emergency environments.

4.5 Future directions

The RF model in this study provides an effective tool for rapid prediction of TBI coagulopathy, but there remains room for improvement. First, multi-center studies could validate the model’s generalizability and include broader patient populations (such as different races, age groups, or patients with comorbidities). Second, exploration of deep learning methods (such as convolutional neural networks) or time series analysis could capture dynamic changes in coagulation status after TBI. Additionally, combining bedside coagulation testing technologies (such as thromboelastography) might further enhance the model’s real-time capability and accuracy. Finally, developing a clinical decision support system (CDSS) based on this model and evaluating its actual impact on patient outcomes through randomized controlled trials would be an important direction for future research.

5 Conclusion

This study demonstrates that a Random Forest-based machine learning model can efficiently predict coagulopathy risk in emergency department TBI patients, with high discriminative ability and clinical utility. Neutrophil percentage, A/G ratio, and ALT play key roles in prediction, suggesting inflammation and liver dysfunction as important drivers of coagulopathy. Despite certain limitations, this model provides a new tool for precise risk stratification in emergency medicine, laying the foundation for improving TBI patient outcomes.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary material, further inquiries can be directed to the corresponding authors.

Ethics statement

The studies involving humans were approved by Ethics Committee of Chongqing Ninth People’s Hospital. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.

Author contributions

HW: Visualization, Data curation, Formal analysis, Methodology, Investigation, Conceptualization, Writing – original draft. WC: Visualization, Data curation, Methodology, Writing – original draft, Conceptualization, Investigation. JH: Conceptualization, Project administration, Writing – review & editing, Supervision, Funding acquisition. YF: Writing – review & editing, Formal analysis, Validation, Methodology. CL: Formal analysis, Validation, Resources, Writing – review & editing.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. This study was supported by the Science and Technology Project of Putian University (Grant No. 2015048). The funding, provided by the institution of the JH was used to support data collection, analysis, and manuscript preparation.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The authors declare that no Gen AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fneur.2025.1649869/full#supplementary-material

References

1. Khellaf, A, Khan, DZ, and Helmy, A. Recent advances in traumatic brain injury. J Neurol. (2019) 266:2878–89. doi: 10.1007/s00415-019-09541-4

PubMed Abstract | Crossref Full Text | Google Scholar

2. Vrettou, CS, and Mentzelopoulos, SD. Second- and third-tier therapies for severe traumatic brain injury. J Clin Med. (2022) 11:790. doi: 10.3390/jcm11164790

PubMed Abstract | Crossref Full Text | Google Scholar

3. Lavinio, A, Coles, JP, Robba, C, Aries, M, Bouzat, P, Chean, D, et al. Targeted temperature control following traumatic brain injury: ESICM/NACCS best practice consensus recommendations. Crit Care. (2024) 28:170. doi: 10.1186/s13054-024-04951-x

PubMed Abstract | Crossref Full Text | Google Scholar

4. Maas, AIR, Menon, DK, Adelson, PD, Andelic, N, Bell, MJ, Belli, A, et al. Traumatic brain injury: integrated approaches to improve prevention, clinical care, and research. Lancet Neurol. (2017) 16:987–1048. doi: 10.1016/S1474-4422(17)30371-X

PubMed Abstract | Crossref Full Text | Google Scholar

5. Jin, J, Wang, F, Tian, J, Zhao, X, Dong, J, Wang, N, et al. Neutrophil extracellular traps contribute to coagulopathy after traumatic brain injury. JCI Insight. (2023) 8:110. doi: 10.1172/jci.insight.141110

PubMed Abstract | Crossref Full Text | Google Scholar

6. Maegele, M, Schöchl, H, Menovsky, T, Maréchal, H, Marklund, N, Buki, A, et al. Coagulopathy and haemorrhagic progression in traumatic brain injury: advances in mechanisms, diagnosis, and management. Lancet Neurol. (2017) 16:630–47. doi: 10.1016/S1474-4422(17)30197-7

PubMed Abstract | Crossref Full Text | Google Scholar

7. Böhm, JK, Schaeben, V, Schäfer, N, Güting, H, Lefering, R, Thorn, S, et al. Extended coagulation profiling in isolated traumatic brain injury: a CENTER-TBI analysis. Neurocrit Care. (2022) 36:927–41. doi: 10.1007/s12028-021-01400-3

PubMed Abstract | Crossref Full Text | Google Scholar

8. Morris, MC, Singer, KE, Niziolek, GM, McGlone, E, Veile, R, Friend, LA, et al. Oxygenation extremes after traumatic brain injury transiently affect coagulation. Thromb Res. (2020) 186:58–63. doi: 10.1016/j.thromres.2019.12.013

PubMed Abstract | Crossref Full Text | Google Scholar

9. Hossain, I, Rostami, E, and Marklund, N. The management of severe traumatic brain injury in the initial postinjury hours - current evidence and controversies. Curr Opin Crit Care. (2023) 29:650–8. doi: 10.1097/MCC.0000000000001094

PubMed Abstract | Crossref Full Text | Google Scholar

10. Nakae, R, Murai, Y, Takayama, Y, Namatame, K, Matsumoto, Y, Kanaya, T, et al. Neurointensive Care of Traumatic Brain Injury Patients Based on coagulation and fibrinolytic parameter monitoring. Neurol Med Chir. (2022) 62:535–41. doi: 10.2176/jns-nmc.2022-0226

PubMed Abstract | Crossref Full Text | Google Scholar

11. Cone, DC, Spaite, DW, and Coats, TJ. Out-of-hospital Tranexamic acid for traumatic brain injury. JAMA. (2020) 324:946–7. doi: 10.1001/jama.2020.9244

PubMed Abstract | Crossref Full Text | Google Scholar

12. Laroche, M, Kutcher, ME, Huang, MC, Cohen, MJ, and Manley, GT. Coagulopathy after traumatic brain injury. Neurosurgery. (2012) 70:1334–45. doi: 10.1227/NEU.0b013e31824d179b

PubMed Abstract | Crossref Full Text | Google Scholar

13. Maegele, M. Coagulopathy after traumatic brain injury: incidence, pathogenesis, and treatment options. Transfusion. (2013) 53:28s–37s. doi: 10.1111/trf.12033

Crossref Full Text | Google Scholar

14. Zhu, F, Pan, Z, Tang, Y, Fu, P, Cheng, S, Hou, W, et al. Machine learning models predict coagulopathy in spontaneous intracerebral hemorrhage patients in ER. CNS Neurosci Ther. (2021) 27:92–100. doi: 10.1111/cns.13509

PubMed Abstract | Crossref Full Text | Google Scholar

15. Dong, JF, Zhang, F, and Zhang, J. Detecting traumatic brain injury-induced coagulopathy: what we are testing and what we are not. J Trauma Acute Care Surg. (2023) 94:S50–s55. doi: 10.1097/TA.0000000000003748

PubMed Abstract | Crossref Full Text | Google Scholar

16. Maegele, M, Aversa, J, Marsee, MK, McCauley, R, Chitta, SH, Vyakaranam, S, et al. Changes in coagulation following brain injury. Semin Thromb Hemost. (2020) 46:155–66. doi: 10.1055/s-0040-1702178

PubMed Abstract | Crossref Full Text | Google Scholar

17. Talving, P, Benfield, R, Hadjizacharia, P, Inaba, K, Chan, LS, and Demetriades, D. Coagulopathy in severe traumatic brain injury: a prospective study. J Trauma. (2009) 66:55–62. doi: 10.1097/TA.0b013e318190c3c0

PubMed Abstract | Crossref Full Text | Google Scholar

18. Zierk, J, Ganslandt, T, Rauh, M, Metzler, M, and Strasser, E. Data mining of reference intervals for coagulation screening tests in adult patients. Clin Chim Acta. (2019) 499:108–14. doi: 10.1016/j.cca.2019.09.006

PubMed Abstract | Crossref Full Text | Google Scholar

19. Hunt, H, Stanworth, S, Curry, N, Woolley, T, Cooper, C, Ukoumunne, O, et al. Thromboelastography (TEG) and rotational thromboelastometry (ROTEM) for trauma induced coagulopathy in adult trauma patients with bleeding. Cochrane Database Syst Rev. (2015) 2015:Cd010438. doi: 10.1002/14651858.CD010438.pub2

PubMed Abstract | Crossref Full Text | Google Scholar

20. Wickham, H, François, R, Henry, L, Müller, K, and Vaughan, D. Dplyr: a Grammar of Data Manipulation. R package version 1.1.4, (2025). Available at: doi: https://dplyr.tidyverse.org

Google Scholar

21. Binder, M, Pfisterer, F, Lang, M, Schneider, L, Kotthoff, L, and Bischl, B. mlr3pipelines-flexible machine learning pipelines in r. J Mach Learn Res. (2021) 22:1–7. Available at: https://jmlr.csail.mit.edu/beta/papers/v22/21-0281.html.

Google Scholar

22. Siriseriwan, W. Smotefamily: a collection of oversampling techniques for class imbalance problem based on SMOTE. R Package. (2019) 1:15. Available at: https://search.r-project.org/CRAN/refmans/smotefamily/html/00Index.html.

Google Scholar

23. Lang, M, Binder, M, Richter, J, Schratz, P, Pfisterer, F, Coors, S, et al. mlr3: a modern object-oriented machine learning framework in R. J Open Source Softw. (2019) 4:1903. doi: 10.21105/joss.01903

Crossref Full Text | Google Scholar

24. Becker, M, Lang, M, Richter, J, Bischl, B, and Schalk, D. mlr3tuning: Hyperparameter Optimization for ‘mlr3’. R package version 1.4.0, (2025). Available at: https://mlr3tuning.mlr-org.com.

Google Scholar

25. Li, L, Jamieson, K, DeSalvo, G, Rostamizadeh, A, and Talwalkar, A. Hyperband: a novel bandit-based approach to hyperparameter optimization. J Mach Learn Res. (2018) 18:1–52. Available at: https://jmlr.csail.mit.edu/papers/v18/16-558.html.

Google Scholar

26. Ginestet, C. ggplot2: elegant graphics for data analysis. J R Stat Soc Ser A Stat Soc. (2011) 174:245–6. doi: 10.1111/j.1467-985X.2010.00676_9.x

Crossref Full Text | Google Scholar

27. Mayer, M. shapviz: SHAP Visualizations. R package version 0.10.2, (2025). Available at: https://github.com/modeloriented/shapviz.

Google Scholar

28. Bajwa, J, Munir, U, Nori, A, and Williams, B. Artificial intelligence in healthcare: transforming the practice of medicine. Fut Healthc J. (2021) 8:e188–94. doi: 10.7861/fhj.2021-0095

PubMed Abstract | Crossref Full Text | Google Scholar

29. Liu, X, Xie, Z, Zhang, Y, Huang, J, Kuang, L, Li, X, et al. Machine learning for predicting in-hospital mortality in elderly patients with heart failure combined with hypertension: a multicenter retrospective study. Cardiovasc Diabetol. (2024) 23:407. doi: 10.1186/s12933-024-02503-9

PubMed Abstract | Crossref Full Text | Google Scholar

30. Wu, Y. Weighted McNemar's test for the comparison of two screening tests in the presence of verification bias. Stat Med. (2022) 41:3149–63. doi: 10.1002/sim.9409

PubMed Abstract | Crossref Full Text | Google Scholar

31. Quintero, L, Meléndez-Lugo, JJ, Palacios-Rodríguez, HE, Caicedo, Y, Padilla, N, Gallego, LM, et al. Damage control in the emergency department, a bridge to life. Colomb Med. (2021) 52:e4004801. doi: 10.25100/cm.v52i2.4801

PubMed Abstract | Crossref Full Text | Google Scholar

32. Chung, CY, and Scalea, TM. Damage control surgery: old concepts and new indications. Curr Opin Crit Care. (2023) 29:666–73. doi: 10.1097/MCC.0000000000001097

PubMed Abstract | Crossref Full Text | Google Scholar

33. Guan, G, Lee, CMY, Begg, S, Crombie, A, and Mnatzaganian, G. The use of early warning system scores in prehospital and emergency department settings to predict clinical deterioration: a systematic review and meta-analysis. PLoS One. (2022) 17:e0265559. doi: 10.1371/journal.pone.0265559

PubMed Abstract | Crossref Full Text | Google Scholar

34. Raita, Y, Goto, T, Faridi, MK, Brown, DFM, Camargo, CA Jr, and Hasegawa, K. Emergency department triage prediction of clinical outcomes using machine learning models. Crit Care. (2019) 23:64. doi: 10.1186/s13054-019-2351-7

PubMed Abstract | Crossref Full Text | Google Scholar

35. Jiang, D, Jiao, L, Li, Q, Xie, R, Jia, H, Wang, S, et al. Neutrophil-derived migrasomes are an essential part of the coagulation system. Nat Cell Biol. (2024) 26:1110–23. doi: 10.1038/s41556-024-01440-9

PubMed Abstract | Crossref Full Text | Google Scholar

36. Li, F, Li, L, Peng, R, Liu, C, Liu, X, Liu, Y, et al. Brain-derived extracellular vesicles mediate systemic coagulopathy and inflammation after traumatic brain injury. Int Immunopharmacol. (2024) 130:111674. doi: 10.1016/j.intimp.2024.111674

PubMed Abstract | Crossref Full Text | Google Scholar

37. Wu, Y, Liu, W, Zhou, Y, Hilton, T, Zhao, Z, Liu, W, et al. von Willebrand factor enhances microvesicle-induced vascular leakage and coagulopathy in mice with traumatic brain injury. Blood. (2018) 132:1075–84. doi: 10.1182/blood-2018-03-841932

PubMed Abstract | Crossref Full Text | Google Scholar

38. Hubbard, WB, Dong, JF, Cruz, MA, and Rumbaut, RE. Links between thrombosis and inflammation in traumatic brain injury. Thromb Res. (2021) 198:62–71. doi: 10.1016/j.thromres.2020.10.041

PubMed Abstract | Crossref Full Text | Google Scholar

39. Maegele, M, Schöchl, H, and Cohen, MJ. An update on the coagulopathy of trauma. Shock. (2014) 41:21–5. doi: 10.1097/SHK.0000000000000088

PubMed Abstract | Crossref Full Text | Google Scholar

40. Albert, V, Arulselvi, S, Agrawal, D, Pati, HP, and Pandey, RM. Early posttraumatic changes in coagulation and fibrinolysis systems in isolated severe traumatic brain injury patients and its influence on immediate outcome. Hematol Oncol Stem Cell Ther. (2019) 12:32–43. doi: 10.1016/j.hemonc.2018.09.005

PubMed Abstract | Crossref Full Text | Google Scholar

41. Neuberger, J, Patel, J, Caldwell, H, Davies, S, Hebditch, V, Hollywood, C, et al. Guidelines on the use of liver biopsy in clinical practice from the British Society of Gastroenterology, the Royal College of Radiologists and the Royal College of pathology. Gut. (2020) 69:1382–403. doi: 10.1136/gutjnl-2020-321299

PubMed Abstract | Crossref Full Text | Google Scholar

42. Chun, HS, Choe, AR, Lee, M, Cho, Y, Kim, HY, Yoo, K, et al. Treatment of direct oral anticoagulants in patients with liver cirrhosis and portal vein thrombosis. Clin Mol Hepatol. (2021) 27:535–52. doi: 10.3350/cmh.2021.0109

PubMed Abstract | Crossref Full Text | Google Scholar

43. Tsai, CH, Rau, CS, Chou, SE, Su, WT, Hsu, SY, and Hsieh, CH. Delta De Ritis ratio is associated with worse mortality outcomes in adult trauma patients with moderate-to-severe traumatic brain injuries. Diagnostics. (2022) 12:3004. doi: 10.3390/diagnostics12123004

PubMed Abstract | Crossref Full Text | Google Scholar

44. Prisco, L, Iscra, F, Ganau, M, and Berlot, G. Early predictive factors on mortality in head injured patients: a retrospective analysis of 112 traumatic brain injured patients. J Neurosurg Sci. (2012) 56:131–6. Available at: https://www.minervamedica.it/en/journals/neurosurgical-sciences/article.php?cod=R38Y2012N02A0131.

PubMed Abstract | Google Scholar

45. Depreitere, B, Becker, C, Ganau, M, Gardner, RC, Younsi, A, Lagares, A, et al. Unique considerations in the assessment and management of traumatic brain injury in older adults. Lancet Neurol. (2025) 24:152–65. doi: 10.1016/S1474-4422(24)00454-X

PubMed Abstract | Crossref Full Text | Google Scholar

46. Peiniger, S, Nienaber, U, Lefering, R, Braun, M, Wafaisade, A, Borgman, MA, et al. Glasgow coma scale as a predictor for hemocoagulative disorders after blunt pediatric traumatic brain injury. Pediatr Crit Care Med. (2012) 13:455–60. doi: 10.1097/PCC.0b013e31823893c5

PubMed Abstract | Crossref Full Text | Google Scholar

47. Alimohammadi, E, Foroushani, AZ, Moradi, F, Ebrahimzadeh, K, Nadersepahi, MJ, Amiri, A, et al. Response to the letter to the editor neutrophil-to-lymphocyte ratio could be used for early prediction of coagulopathy occurrence in traumatic brain injury. Injury. (2022) 53:3558–9. doi: 10.1016/j.injury.2022.08.022

PubMed Abstract | Crossref Full Text | Google Scholar

48. Mukerji, S, Darwin, S, Suchdev, K, Levine, A, Xu, L, Daneshmand, A, et al. Persistent coagulopathy after gunshot traumatic brain injury: the importance of INR and the SPIN score. Eur J Trauma Emerg Surg. (2022) 48:4813–22. doi: 10.1007/s00068-022-02009-7

PubMed Abstract | Crossref Full Text | Google Scholar

49. de Oliveira Manoel, AL, Neto, AC, Veigas, PV, and Rizoli, S. Traumatic brain injury associated coagulopathy. Neurocrit Care. (2015) 22:34–44. doi: 10.1007/s12028-014-0026-4

PubMed Abstract | Crossref Full Text | Google Scholar

50. Christoffel, J, and Maegele, M. Guidelines in trauma-related bleeding and coagulopathy: an update. Curr Opin Anaesthesiol. (2024) 37:110–6. doi: 10.1097/ACO.0000000000001346

PubMed Abstract | Crossref Full Text | Google Scholar

51. Ahmad, R, Maghrabi, LA, Khaja, IA, Maghrabi, LA, and Ahmad, M. SMOTE-based automated PCOS prediction using lightweight deep learning models. Diagnostics. (2024) 14:225. doi: 10.3390/diagnostics14192225

PubMed Abstract | Crossref Full Text | Google Scholar

52. Kosolwattana, T, Liu, C, Hu, R, Han, S, Chen, H, and Lin, Y. A self-inspected adaptive SMOTE algorithm (SASMOTE) for highly imbalanced data classification in healthcare. BioData Min. (2023) 16:15. doi: 10.1186/s13040-023-00330-4

PubMed Abstract | Crossref Full Text | Google Scholar

53. Qi, X, Wang, S, Fang, C, Jia, J, Lin, L, and Yuan, T. Machine learning and SHAP value interpretation for predicting comorbidity of cardiovascular disease and cancer with dietary antioxidants. Redox Biol. (2025) 79:103470. doi: 10.1016/j.redox.2024.103470

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: traumatic brain injury, coagulopathy, machine learning, emergency medicine, feature importance, SHAP analysis, risk stratification

Citation: Wang H, Cao W, Huang J, Feng Y and Li C (2025) Machine learning models predict coagulopathy in traumatic brain injury patients in ER. Front. Neurol. 16:1649869. doi: 10.3389/fneur.2025.1649869

Received: 19 June 2025; Accepted: 02 September 2025;
Published: 18 September 2025.

Edited by:

Jin Lu, Shanghai Jiao Tong University, China

Reviewed by:

Mario Ganau, Oxford University Hospitals NHS Trust, United Kingdom
Vitor Nespoli, Rush University Medical Center, United States

Copyright © 2025 Wang, Cao, Huang, Feng and Li. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Jianhuang Huang, dGVhbWh1YW5nQHNpbmEuY29t; Yuxing Feng, ZmVuZ3l1eGluZy0xMUAxNjMuY29t; Cheng Li, MzY0Nzk5Njg0QHFxLmNvbQ==

^†These authors share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.