Skip to main content

ORIGINAL RESEARCH article

Front. Med., 03 December 2021
Sec. Intensive Care Medicine and Anesthesiology
Volume 8 - 2021 | https://doi.org/10.3389/fmed.2021.775047

An Interpretable Early Dynamic Sequential Predictor for Sepsis-Induced Coagulopathy Progression in the Real-World Using Machine Learning

Ruixia Cui1,2 Wenbo Hua3 Kai Qu1 Heran Yang3 Yingmu Tong1,2 Qinglin Li1,2 Hai Wang1,2 Yanfen Ma4 Sinan Liu1,2 Ting Lin1,2 Jingyao Zhang1,2,5* Jian Sun3* Chang Liu1,2,5*
  • 1Department of Hepatobiliary Surgery, The First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, China
  • 2Department of SICU, The First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, China
  • 3School of Mathematics and Statistics, Xi'an Jiaotong University, Xi'an, China
  • 4Department of Clinical Laboratory, The First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, China
  • 5Biobank, The First Affiliated Hospital of Xi'an Jiaotong University, Xi'an, China

Sepsis-associated coagulation dysfunction greatly increases the mortality of sepsis. Irregular clinical time-series data remains a major challenge for AI medical applications. To early detect and manage sepsis-induced coagulopathy (SIC) and sepsis-associated disseminated intravascular coagulation (DIC), we developed an interpretable real-time sequential warning model toward real-world irregular data. Eight machine learning models including novel algorithms were devised to detect SIC and sepsis-associated DIC 8n (1 ≤ n ≤ 6) hours prior to its onset. Models were developed on Xi'an Jiaotong University Medical College (XJTUMC) and verified on Beth Israel Deaconess Medical Center (BIDMC). A total of 12,154 SIC and 7,878 International Society on Thrombosis and Haemostasis (ISTH) overt-DIC labels were annotated according to the SIC and ISTH overt-DIC scoring systems in train set. The area under the receiver operating characteristic curve (AUROC) were used as model evaluation metrics. The eXtreme Gradient Boosting (XGBoost) model can predict SIC and sepsis-associated DIC events up to 48 h earlier with an AUROC of 0.929 and 0.910, respectively, and even reached 0.973 and 0.955 at 8 h earlier, achieving the highest performance to date. The novel ODE-RNN model achieved continuous prediction at arbitrary time points, and with an AUROC of 0.962 and 0.936 for SIC and DIC predicted 8 h earlier, respectively. In conclusion, our model can predict the sepsis-associated SIC and DIC onset up to 48 h in advance, which helps maximize the time window for early management by physicians.

Introduction

Sepsis is a lethal disease caused by a dysregulated host response in an infected state (1). Septic-induced organ dysfunction is a major cause of sepsis high mortality (2). Among these, coagulation dysfunction is a pervasive complication of sepsis, occurring in 50–70% of sepsis patients, while approximately 35% of patients proceed to disseminated intravascular coagulation (DIC) (3). Sepsis-induced coagulopathy (SIC) mortality reaches to 23.1% (4), while the mortality rate of sepsis-associated DIC is more than twice that of simple sepsis patients (5, 6). According to the SIC scoring system proposed by the DIC subcommittee of International Society on Thrombosis and Haemostasis (ISTH) in 2017, sepsis patients easily meet the SIC diagnostic criteria (7). Sepsis-associated DIC was diagnosed by the two-step sequential approach of SIC and ISTH overt-DIC criteria, which is a late-phase coagulation disorder that should be detected early (8). Currently, DIC diagnosis does not have a gold standard. Physicians diagnose DIC according to the primary disease, clinical manifestations, and laboratory tests. However, the clinical signs and symptoms of DIC appear slowly and the manifestations are complex and varied, resulting in a time lag in the clinical diagnosis of DIC, which places the patient in a treatment-refractory phase when sepsis-associated DIC is clinically determined (9). In addition, studies have shown that anticoagulation is ineffective in both sepsis and SIC patients, but effective in sepsis-induced DIC patients (1012). Hence, early recognition of sepsis-associated DIC is more important than SIC, while there are currently no studies on sequential prediction of sepsis-associated DIC after SIC alerts. Therefore, it is imperative to establish a model for early sequential real-time prediction of SIC and sepsis-associated DIC.

The prevalence of electronic health records (EHRs) and the upsurge of artificial intelligence (AI) provide opportunities for clinical medical research (13). Studies have shown that machine learning-based models outperform traditional clinical scoring and human expert systems in the diagnosis, treatment, and prognosis prediction of clinical diseases (14, 15). However, current clinical prediction studies are mainly static and lack real-time prediction studies. Real-time prediction models dynamically predict the onset of disease within a sliding time-window by continuously updating clinical data. From the clinical dynamic treatment perspective, real-time predictive models would better fit the clinical applications (16). In addition, the variability of primary diseases, comorbidities and severity of conditions in ICU patients leads to sparse and irregular clinical data in terms of sampling time and dimensions (17). To accommodate irregular time series data, the existing standard models such as eXtreme Gradient Boosting (XGBoost) (18) and Recurrent Neural Network (RNN) (19) decompose time into continuous, non-overlapping uniform intervals, known as temporal discretization (20). This enables the standard models to act on fixed dimensional vectors (regular data). However, this approach lacks the continuity principle and can lead to undesirable results when applied to irregular medical time-series data (21). Altogether, it is necessary to develop a model that is specifically designed to handle sparse irregular time series data in the real clinical world to achieve real-time accurate predictions at arbitrary time points.

In summary, we aim to help physicians to identify patients at high risk of SIC and sepsis-associated DIC early, especially those who progress to DIC after SIC, as well as improve existing machine learning models to enable arbitrary time-point prediction on real-world irregular data. To achieve this, an interpretable early real-time sequential warning predictor will be developed that contributes to early personalized treatment and reasonable administration. The overview of the study design and model development was shown in Figure 1.

FIGURE 1
www.frontiersin.org

Figure 1. Study design and model development overview. (A) Research application design profile. Study developed an early real-time prediction model to dynamically predict disease progression of sepsis patients after admission to ICU, and to output whether the patient will suffer from SIC and sepsis-associated DIC and the risk score for their onset in future periods. The model feeds into the information obtained from a continuous 24 h sampling time-window and outputs the disease status for the next 8n h in real-time. The x-axis indicates the time since admission. (B) Model development overview. The research process comprises to data processing, model development, model evaluation, and interpretation. The data processing process includes raw data extraction, outlier processing, coagulation status annotation, and feature engineering. The model development stage included feature set split, sampling time-window selection pre-experimental, model construction, and training. We built eight models, trained and selected the Full model with good performance on the internal validation set by SGD or Adam optimization. The Lite models were developed by selecting easily accessible and important features through SHAP and Occlusion analysis. Models were validated at BIDMC for external validation. The model evaluation and interpretation section include the assessment of model predictive performance, interpretation of model outputs, individualized disease trajectory prediction, etc.

Methods

Study Cohort and Design

This is a multicenter retrospective cohort study. Research data were obtained from two medical centers, the Xi'an Jiaotong University Medical College (XJTUMC) and Beth Israel Deaconess Medical Center (BIDMC). Structured query language (SQL) was used to obtain eligible patient data for the period from January 1, 2013 to December 1, 2018 in XJTUMC and from 2001 to 2012 in BIDMC, respectively. The XJTUMC data were obtained from the Center's Biobank and the BIDMC data were obtained from the Medical Information Mark for Intensive Care (MIMIC-III) database (22). The study was reviewed by the Ethics Committee of the First Affiliated Hospital of Xi'an Jiaotong University, and all data were deidentified.

The enrollment process was shown in Figure 2. The inclusion criteria were as follows: (1) sepsis was diagnosed within 24 h of admission based on Sepsis 3.0 criteria; (2) the age was not <18 years; and (3) the duration of hospitalization was not <3 days. The exclusion criteria were as follows: (1) patients with DIC onset within 24 h of admission; (2) patients were affected by hematologic tumors (leukemia, lymphoma, etc.); (3) patients suffering from cirrhosis, acute liver failure, with liver function up to Child C; (4) patients treated with radiotherapy or chemotherapy; (5) patients with admission diagnosis of combat trauma, traumatic coagulopathy; (6) patients with pregnancy or perinatal complications.

FIGURE 2
www.frontiersin.org

Figure 2. The diagram of study patient enrollment. In XJTUMC, coagulation status annotation was based on SIC and ISTH-overt DIC criteria. In BIDMC, DIC were labeled based on ICD-9. ICD, International Classification of Diseases.

Data Preprocessing

We initially obtained a total of 174 laboratory features in XJTUMC and 259 laboratory features in BIDMC. Subsequently, we performed the process of merging identical variables (e.g., HGB with different units of g/L or g/dl, HGB in blood count and blood gas test, etc.), eliminating irrelevant variables (e.g., hepatitis antibody quantification, blood drug concentration, etc.), and counting the frequency of variable detection. We removed indictors that were completely missing and detected <1% of indicators at all-time points. In addition, we performed the unit conversions for the XJTUMC laboratory test variables, in order to maintain consistency with BIDMC. Ultimately, under the guidance of the laboratory physicians, we identified 99 features at XJTUMC and 72 features at BIDMC. Of these, the 72 features at BIDMC were common features for both medical centers. The missing information for both medical centers were shown in the Supplementary Figure 1 (Supplementary File 1). And the clinical reportable ranges for each identified variable was shown in Supplementary File 2. All variables were initialized by the min-max normalization algorithm.

Clinical sequence data are sparse and irregular, manifested by the presence of a large amount of missing data. Whereas, the pattern of missing data contains important information, such as the correlation between the frequency of a certain marker test and the severity of the disease. Therefore, in this study, we do not deal with missing data. Instead, we use the modeling of missing information to identify the role of missing patterns in prediction, which enhances the model prediction effectiveness. In brief, it is modeled by learning to characterize the missing and hidden information from the time-series data, which is then further introduced into the model network. The multivariate time series X = {x1, x2, …, xT} is the observations at time T. xtRDrepresents the observations at time t for all variables. xt contains D features {xt1,xt2,xtD} and xtd represents the d-th feature variable of xt. st denotes the timestamp of observation xt. We assume that the timestamp of the first observation is 0 (st = 0), the time interval between different timestamps may be the same or different. Δ indicates adjacent time steps for each variable. To provide an efficient representation of the missing values, we introduce the mask vector mtd={0, 1} to represent the missing variables in xt at time t. Some features are missing continuously over a period of time, and δtd is defined to represent the time interval between the last observation and the current timestamp. To be more specific, we have:

mtd ={1,if xtd is observed0,otherwise    (1)
δtd ={stst1+δt1dt>1, mt1d=0stst1t>1, mt1d=10t=1    (2)

Thus, a dynamic feature of the input would be represented as Xt=(xtd,mtd,Δ,δtd). In the later section, the missing information will be introduced into the model for subsequent processing when improving the algorithm model.

Ground-Truth Labels Using ISTH

We defined all three disease states according to SIC and ISTH overt-DIC criteria. The details of the disease status annotation criteria can be found in Supplementary Table 1 (Supplementary File 1). New data were not available for all 8 h time-window of the day because the patient's laboratory tests were irregular. If there are no updated data available for labeling in an 8 h time-window, then the forward interpolation method is used for labeling based on the labels before and after that time-window. More specific details were shown in Supplementary Figure 2 (Supplementary File 1).

Continuous Models for Irregular Time Series

The study compared two methods for dealing with irregular time series. The first is the temporal discretization approach, in which the standard RNN model is a typical model. In the second approach, we tackle the irregular time-series problem by modeling the missing information. Based on the standard RNN model, we introduced a decay mechanism for modeling missing information by referencing Che et al.'s study (21), to eventually develop the RNN-Decay model. In addition, we developed the Ordinary Differential Equations-Recurrent Neural Networks (ODE-RNN) model by directly modeling the unequally spaced raw data referring to the work of Yulia et al. (23). The model architecture diagrams were shown in Figure 3. The model inputs are derived from the extraction of missing patterns of time series information from “Data Processing” described in the above section.

FIGURE 3
www.frontiersin.org

Figure 3. Architecture diagram of different RNN-based models for processing irregular time-series data. (A) Diagram of the standard RNN model processing irregular time-series data. The standard RNN transmits the hidden layer information forward at a fixed time interval without processing the hidden layer information; (B) Diagram of RNN-Decay model processing irregular time-series data. Based on the RNN framework, RNN-decay adds temporal decay derived from time intervals to the latent space in order to indirectly process irregular series; however, the longer the time interval, the weaker the information conveyed. (C) Diagram of ODE-RNN model processing irregular time-series data. The ODE-RNN directly modeled the unequally spaced raw data, and propagated a latent variable along the time interval using the Neural ODE to solve the latent layer information that corresponds to the irregular timestamp; this allows the arbitrary time-point data to be processed and guarantees temporal continuity. The time series T = {t0, t1, …, tN} is the observation timestamps, X = {x0, x1, …, xN} represents the observations at time t for all variables, h(t) is the latent space state corresponding to timestamp t. Orange circles indicate an RNN unit. Blue and black dashed lines with arrows indicate the shallow and latent features of the input RNN, and orange dashed lines with arrows indicate data from the previous RNN unit that has been processed in a different way and then fed into the next RNN unit.

Model Development

To select the appropriate sampling time length, we chose 8, 24, and 48 h sampling time-window to perform pre-experiments. The results were presented in the Supplementary Table 2 (Supplementary File 1). Finally, we set the sampling time-window to 24 h and the sliding time-step to 8 h. Then, we developed several state-of-the-art models that are widely used as follows: (1) Classic machine learning models: Logistic regression (LR) (24) and support vector machines (SVM) (25), which are the most commonly used algorithms in existing research; (2) Enhanced machine learning models: gradient boosting machine (LightGBM) (26) and XGBoost (27), which are widely regarded as the best algorithm for data prediction and are adopted by many competition winning models in the field of machine learning; (3) Classic deep learning models: RNN (21) and long short-term memory network (LSTM) (28), which are the most commonly chosen deep learning models in time-series data, which have shown excellent performance in several time series studies; (4) Improved deep learning models: RNN-Decay and ODE-RNN. The detailed method was described in Supplementary File 1. After evaluating the performance, we finally chose the highest-performing XGBoost model as our predictor.

Model Evaluation

We random divided the XJTUMC data into a train set (70%), a valid set (10%), and a test set (20%). The BIDMC data ware used as the external validation set to evaluate generalization ability of the model. Parameter optimization were performed by Stochastic Gradient Descent (SGD) or the Adam algorithm. The area under the receiver operating characteristic curve (AUROC), the area under the precision-recall curve (AUPRC) and the F1-score were used as model evaluation metrics. Test samples were resampled 1,000 times using bootstrapping to calculate 95% confidence intervals.

Model Interpretation

In this study, we interpreted the machine learning model results using the Shapley Additive exPlanations (SHAP) algorithm (29) and the deep learning model results using Occlusion analysis (30).

Statistical Analysis

Baseline data are skewed and expressed as the median and interquartile range (IQR). Non-parametric tests were used for statistical tests. P-values < 0.001 were considered statistically significant. Navicat Premium (12.1.22), Pytorch (1.7.0), and Python (3.7.6) with Numpy (1.18.5), Pandas (1.1.5), and Scikit-learn (0.23.2) formed the data-processing pipeline. All computational analyses were performed in the Computer Center of the School of Mathematics, Xi'an Jiaotong University.

Results

Study Baseline

A total of 9,717 infection patients in XJTUMC and 15,059 sepsis patients in BIDMC were initially included when applying ICD-9 codes and sepsis 3.0 criteria. After applying the exclusion criteria, 3,809 and 11, 926 patients were left. We then annotated the coagulation status of 3,809 XJTUMC sepsis patients by SIC and ISTH overt-DIC criteria, a total of 12,154 SIC status labels (positive: 8,909; negative: 3,246); and 7,878 overt-DIC status labels (positive: 3,051; negative: 4,827); were available in XJTUMC. Also, we selected 296 patients (1,210 status labels) who developed DIC during hospitalization at the BIDMC center using ICD-9 codes. The baseline characteristics of included patients at XJTUMC and BIDMC were shown in Table 1. For both XJTUMC and BIDMC, the median age of sepsis patients is above 60 years and the predominant cause of sepsis was respiratory system-derived infections. Also, 652/1,415 (46.1%) of SIC patients developed to DIC, while 652/679 (96%) of DIC patients fulfilled the SIC diagnosis.

TABLE 1
www.frontiersin.org

Table 1. Baseline characteristics of included patients at XJTUMC and BIDMC.

Full Model Performance

The predictive performance of eight different models for early DIC onset were shown in Figure 4. Figures 4A,B showed that XGBoost produced the best prediction performance (AUROC: 0.955; AUPRC: 0.939) and was validated in LightGBM, followed by ODE-RNN (AUROC: 0.936; AUPRC: 0.902). However, ODE-RNN ensures continuity of model prediction, which is more suitable for clinical applications than XGBoost. Figure 4C revealed that the performance of the XGBoost on the BIDMC external validation set has decreased (AUROC: 0.865). Figure 4D provided prediction performance of the model at different prediction time-window, revealing that our model could detect the event as early as 48 h before the ground-truth. Figure 5 illustrated the prediction performance for SIC, showing similar results to DIC. Figures 5A,B showed that XGBoost provided the best prediction performance (AUROC: 0.973; AUPRC: 0.979). Figure 5C revealed that the performance of the XGBoost on the BIDMC external validation set has decreased (AUROC: 0.973). Figure 5D revealed that XGBoost could detect the event as early as 48 h before the ground-truth with AUROC reached 0.929. The detailed predictive performance of models at different prediction time-window was shown in Table 2. The model prediction performance decreases steadily as the prediction time-window extends. The XGBoost and ODE-RNN still maintain good prediction performance for SIC and DIC in the 48 h ahead of prediction time-window. Furthermore, we examined the early warning performance of the models on SIC and DIC with different alert thresholds at the 8 h prediction time-window, the results were shown in Supplementary Tables 3, 4 (Supplementary File 1). That allows clinicians to select different thresholds according to the characteristics of the different stages of disease development and treatment needs.

FIGURE 4
www.frontiersin.org

Figure 4. Full model performance evaluation. (A) Receiver operating characteristic curves of models for DIC prediction on the internal test set. (B) Precision-recall curve of candidate models for DIC prediction on the internal test set. (C) Receiver operating characteristic curves on the external test set. (D) The predictive performance of models at 8, 24, and 48 h ahead of sepsis-associated DIC onset. Precision was defined as the fraction of correctly alerted 8 h before the ground-truth event. Recall was defined as the proportion of any alarms 8 h prior to the ground-truth event. The lighter-colored intervals above and below the ROC and PR curves are 95% confidence intervals.

FIGURE 5
www.frontiersin.org

Figure 5. Full model performance evaluation for SIC prediction. (A) Receiver operating characteristic curves of models for SIC prediction on the internal test set. (B) Precision-recall curve of candidate models for SIC prediction on the internal test set. (C) Receiver operating characteristic curves on the external test set. (D) The predictive performance of models at 8, 24, and 48 h ahead of SIC onset.

TABLE 2
www.frontiersin.org

Table 2. Model performance at different prediction time-window for SIC and DIC prediction.

Model Interpretation

To understand the contribution of the features to the model predictions, we interpreted the XGBoost predictions using Shapley values, which were presented in Figure 6. Figure 6A showed the top 20 features that contribute most to the model output; Figure 6B showed the impact of the top 20 features on all samples in the model. Shaply analysis identified that the most valuable features for DIC prediction were platelet (PLT), D-dimer, International normalized ratio (INR), plateletcrit (PCT), fibrinogen (FIB), fibrin degradation products (FDP). We further developed dependency plots to capture the non-linear correlation between a single significant feature and the predicted risk. As an example, Figure 6C showed that when PLT was below 80, the shape-value was significantly increased with higher predicted risk. Figure 6D showed the interaction between PLT and PCT, where when PCT is low, the corresponding PLT feature value is low, SHAP takes a high value and the model output risk increases. In addition, the SHAP force plot provides insight into the output risk and decision factors for specific samples. In the case of Figure 6E, the model predicts the sample at a high risk of a DIC event based on PT, INR, and PLT.

FIGURE 6
www.frontiersin.org

Figure 6. SHAP-based interpretation of model. (A) Overall feature importance of the top 20 features. (B) Beeswarm plot showing the impact of the top 20 features across all samples in the model. Beeswarm combines feature importance and feature effect, ranking the features by the sum of the SHAP over all samples (y-axis). In the plot, one row represents one feature and each dot represents the feature SHapley value for one sample, gray dots represent missing values, colors represent feature values (red high, blue low). Long tails indicate that features are extremely important for a particular patient. The x-axis represents the impact on the model output, with positive values pushing risk higher and negative values driving risk lower. (C) SHAP dependence plot showing predicted risk vs. feature value for PLT. The x-axis is the range of eigenvalues of PLT features and the y-axis is the shape-value of PLT features. It showed that when PLT was below 80, the shape-value was significantly increased with higher predicted risk. (D) SHAP interaction dependency plot describes the relationship between two interaction features and predicted risk. It showed the interaction between PLT and PCT, where when PCT is low, the corresponding PLT feature value is low, SHAP takes a high value and the model output risk increases. (E) SHAP force plot shows an interpretable example of single-sample feature prediction results. The base value is the predicted average of the all sample, and the output value is the predicted value of the current sample. Red indicates that the feature increases the risk, blue indicates that the feature decreases the risk. Longer feature bars indicate that the feature is contributing more.

In addition, we interpreted the deep learning model ODE-RNN using occlusion analysis, as shown in Supplementary File 3. Occlusion analysis showed that a global absence of a single feature has small impact on the model output results for the ODE-RNN model. We also performed univariate and multivariate analyses of baseline data on the first day for different coagulation status groups, as detailed in Supplementary Tables 5, 6 (Supplementary File 1), suggesting the reliability of the machine learning approach.

Figure 7 showed an example of real-time sequential prediction using our model on one patient. At each time point after the patient admission over 24 h, the model provides a real-time risk and uncertainty assessment of the future SIC and sepsis-associated DIC onset. This showed that the model could detect SIC and sepsis-associated DIC 48 h early, which is important for clinicians to take precautionary approaches ahead of the event.

FIGURE 7
www.frontiersin.org

Figure 7. An interpretable real-time risk forecasting example. A 69-year-old male patient was hospitalized with the diagnosis of “Acute biliary pancreatitis, Obstructive jaundice.” The patient was diagnosed with SIC on the third day and sepsis-associated DIC on the fifth day of admission by the clinician. Our model predicted the disease progression earlier than the clinician. The predictions revealed that the most of sepsis-associated DIC events could be accurately predicted up to 8 h earlier. However, there were some sepsis-associated DIC events were still not predicted, mainly in the persistence of DIC onset, which might be related to clinical medicine interventions. The bottom of the graph is the SIC risk score and the upper is the DIC risk score. Red circles indicate sepsis-associated DIC events evaluated by the model. Red stars indicate true future predictions (meeting the ISTH overt-DIC criteria).

Lite Model Development

To enhance the transferability and reduce the data requirements of the model, we selected the ten most influential features based on SHAP values, Occlusion analysis, and clinical practicability (The Lite model feature details were shown in Supplementary File 2). Based on the selected features, we constructed the Lite model. Figure 8 indicated that the XGBoost-based Lite model achieves the best performance of predicting DIC 8 h in advance, but the model performance is slightly lower than that of the Full model (AUROC: 0.916 vs. 0.955).

FIGURE 8
www.frontiersin.org

Figure 8. Lite model performance evaluation. (A) Receiver operating characteristic curves of XGBoost for DIC. (B) Receiver operating characteristic curves of ODE-RNN for DIC.

Discussion

In this study, we developed two models enabling real-time sequential monitoring of SIC and DIC disease progression in sepsis. The model could identify high-risk patients 48 h before the clinical diagnosis of SIC and sepsis-associated DIC, achieving the state-of-the-art retrospective performance. The full XGBoost model currently achieved the highest prediction performance (SIC: 0.973; DIC: 0.955). The Lite XGBoost model also achieved pleasing prediction performance (DIC: 0.916). On the BIDMC test set, the model performance decreased slightly (DIC: 0.865). In the 8 h prediction time-window with a threshold of 0.7, the XGBoost model was able to predict 89.5% of SIC and 83.1% of sepsis-associated DIC events correctly. Meanwhile, our study has introduced a methodological improvement. Specifically, our study provided the following contributions: (1) We developed and validated the first model for earlier sequential dynamic monitoring of sepsis-induced coagulation disease progression; (2) We processed irregular time-series data for the first time in dynamic prediction research using ODE method, achieving predictions at arbitrary time points; (3) We provided the visual interpretation for deep learning models and machine learning models, respectively, which improved the recognition of physicians toward complex models.

Sepsis-induced SIC, particularly sepsis-associated DIC, is a major cause of increased mortality in sepsis. However, clinical DIC diagnosis relies on FIB and D-Dimer whose laboratory testing frequency is <10% (31), resulting in a lag in DIC diagnosis. Therefore, it is meaningful to use a full spectrum of laboratory tests for the early detection of coagulation disorders. To our knowledge, there is only one study using machine learning to predict the progression of sepsis-induced DIC that was published in 2020 (32). Hasegawa et al. performed three classical machine learning methods to predict the progression of sepsis-induced coagulation disorders. In that study, sepsis was defined based on Systemic Inflammatory Response Syndrome (SIRS) criteria rather than sepsis 3.0 criteria. In addition, the study used the static data and the accuracy of the model to predict the progression of coagulopathy was only 59.8–67.0% (32). This is far poorer than our model prediction performance, which suggests that dynamic data monitoring is more consistent with clinical application than static models. In our study, high-risk patients were identified up to 48 h earlier, which suggests that comprehensive use of laboratory tests could detect coagulation disorders earlier. The Lite model also achieves satisfactory predictive performance.

The irregularity of clinical time series data was reflected in the tables as a large number of missing. Previous studies deal with large amounts of missing data by removing missing variables or using statistical interpolation methods, but such methods are not applicable to time series data (33). Neural ODE is a continuous dynamics theory that can explore the dynamic interactions between key features in the timeline of event onset and development (34). Our results demonstrate the superiority of ODE in the dynamics of disease. In a recent review, Alber et al. showed that it remains a challenge to apply ODE in medical continuous monitoring studies with incomplete baseline data and low-sampling data (35). Our study introduced ODE into RNNs and achieved better performance than RNN and LSTM, showing that ODE-RNNs are more appropriate for sparse irregular data than standard deep learning models. However, our results showed that the performance of the ODE-RNN was lower than that of the gradient boosted tree model (XGBoost and LightGBM). We consider that clinical laboratory diagnoses usually use hierarchical stratification for diagnosis, which fits better with the splitting structure of the tree model and gives the tree model a natural advantage. In addition, Qin et al. showed that neural models are not good at efficient feature transformation and scaling, while the tree-based model has an advantage in this respect (36). However, the flexibility and variety of tasks that can be achieved with deep learning are not available with traditional methods when dealing with complex tabular problems. Furthermore, occlusion analysis showed that a single feature masking does not affect the model significantly, indicating that ODE-RNN has better perturbation resistance. That is, the overall absence of a particular examination does not have a large impact on the ODE-RNN model, suggesting that the ODE-RNN model may have better robustness. Finally, the arbitrary time point continuous prediction achieved by ODE-RNN is also not possible with the gradient boosted tree model, where this arbitrary time point continuity is significant for clinical applications.

Our study offers the following potential benefits: Firstly, it is essential for ICU clinicians and nurses to identify patients who truly need intensive attention and personalized preventive medication. Our predictor can reduce the alarm frequency in patients without a high risk of sepsis-associated DIC occurrence after SIC. Secondly, our ODE-RNN model provides a reference for model selection of real-world irregular time series processing. This will facilitate subsequent studies to build robust models that better match the characteristics of clinical data. Thirdly, our model can be used to identify sepsis patients in different states of coagulation impairment, which could be useful for future randomized controlled clinical studies and further assist physicians to evaluate the time window of anticoagulation therapy appropriately. Finally, our interpretable model provides a visual interactive operating system prototype for early warning systems in ICU and will facilitate clinical deployment of predictive models.

However, some limitations also exist in our study. First of all, our model was developed in a single center, which reduces the effectiveness and may require retraining when the model is migrated to other centers. Furthermore, because we failed to obtain bedside real-time vital sign monitoring data, our model did not incorporate these parameters which may diminish the efficacy of the model. Finally, our study is retrospective and further prospective clinical studies need to be validated.

Conclusion

Our early dynamic sequential predictor enables identification of sepsis patients at high risk of SIC and DIC up to 48 h earlier, achieving the highest performance to date. Our study showed that the ODE-RNN model achieves better performance than the standard RNN model. Our study contributes to early personalized management, and also improves the currently available algorithms.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics Statement

The studies involving human participants were reviewed and approved by XJTU1AF2020LSL-003. Written informed consent for participation was not required for this study in accordance with the national legislation and the institutional requirements.

Author Contributions

RC designed the experiments, provided and preprocessed the clinical data, developed the supervised learning pipelines, and drafted the manuscript. WH designed the experiments, developed the machine learning pipelines, constructed the deep learning pipelines, and drafted the manuscript. KQ preprocessed the data and carried out the multivariate regression analysis. HY annotated the disease status labels and developed the deep learning pipelines. YT, QL, and HW carried out the data filtering and univariate analysis. YM, SL, and TL defined the clinically reportable range and performed multivariate regression analysis. JZ, JS, and CL revised the manuscript, conceived, and directed the project. All authors read and approved the final manuscript.

Funding

This study was supported by the Critical Clinical Research Project of the First Affiliated Hospital of Xi'an Jiaotong University (No. XJTU1AF-CRF-2020-003) and the Joint Project of Universities in Shaanxi Province - Key Project (No. 2021GXLH-Z-099).

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Acknowledgments

We gratefully acknowledge the data support from the Biobank of First Affiliated Hospital of Xi'an Jiaotong University, and the MIMIC project from Massachusetts Institute of Technology and the Beth Israel Deaconess Medical Center. We also thank the helpful discussions with Yan Yang, Ph.D., and Dong Yang, Ph.D., School of Mathematics and Statistics, Xi'an Jiaotong University. We thank the technician Hui Liu who performed the data quality control, Biobank, the First Affiliated Hospital of Xi'an Jiaotong University. Finally, thanks to all reviewers for their time and suggestions.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fmed.2021.775047/full#supplementary-material

References

1. Coopersmith CM, De Backer D, Deutschman CS, Ferrer R, Lat I, Machado FR, et al. Surviving sepsis campaign: research priorities for sepsis and septic shock. Intensive Care Med. (2018) 44:1400–26. doi: 10.1007/s00134-018-5175-z

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Prescott HC, Angus DC. Enhancing recovery from sepsis: a review. JAMA. (2018) 319: 62–75. doi: 10.1001/jama.2017.17687

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Levi M, van der Poll T. Coagulation and sepsis. Thromb Res. (2017) 149:38–44. doi: 10.1016/j.thromres.2016.11.007

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Iba T, Umemura Y, Watanabe E, Wada T, Hayashida K, Kushimoto S, et al. Diagnosis of sepsis-induced disseminated intravascular coagulation and coagulopathy. Acute Med Surg. (2019) 6:223–32. doi: 10.1002/ams2.411

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Voves C, Wuillemin WA, Zeerleder S. International Society on Thrombosis and Haemostasis score for overt disseminated intravascular coagulation predicts organ dysfunction and fatality in sepsis patients. Blood Coagul Fibrin. (2006) 17:445–51. doi: 10.1097/01.mbc.0000240916.63521.2e

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Lyons PG, Micek ST, Hampton N, Kollef MH. Sepsis-associated coagulopathy severity predicts hospital mortality. Crit Care Med. (2018) 46:736–42. doi: 10.1097/CCM.0000000000002997

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Iba T, Nisio MD, Levy JH, Kitamura N, Thachil J. New criteria for sepsis-induced coagulopathy (SIC) following the revised sepsis definition: a retrospective analysis of a nationwide survey. BMJ Open. (2017) 7:e017046. doi: 10.1136/bmjopen-2017-017046

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Iba T, Arakawa M, Di Nisio M, Gando S, Anan H, Sato K, et al. Newly proposed sepsis-induced coagulopathy precedes international society on thrombosis and haemostasis overt-disseminated intravascular coagulation and predicts high mortality. J Intensive Care Med. (2020) 35:643–9. doi: 10.1177/0885066618773679

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Gando S, Levi M, Toh CH. Disseminated intravascular coagulation. Nat Rev Dis Primers. (2016) 2:16037. doi: 10.1038/nrdp.2016.37

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Umemura Y, Yamakawa K, Ogura H, Yuhara H, Fujimi S. Efficacy and safety of anticoagulant therapy in three specific populations with sepsis: a meta-analysis of randomized controlled trials. J Thromb Haemost. (2016) 14:518–30. doi: 10.1111/jth.13230

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Umemura Y, Yamakawa K. Optimal patient selection for anticoagulant therapy in sepsis: an evidence-based proposal from Japan. J Thromb Haemost. (2018) 16:462–4. doi: 10.1111/jth.13946

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Iba T, Levy JH, Warkentin TE, Thachil J, van der Poll T, Levi M. Diagnosis and management of sepsis-induced coagulopathy and disseminated intravascular coagulation. J Thromb Haemost. (2019) 17:1989–94. doi: 10.1111/jth.14578

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Ngiam KY, Khor IW. Big data and machine learning algorithms for health-care delivery. Lancet Oncol. (2019) 20:e262–73. doi: 10.1016/S1470-2045(19)30149-4

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Jiang F, Jiang Y, Zhi H, Dong Y, Li H, Ma S, et al. Artificial intelligence in healthcare: past, present and future. Stroke Vasc Neurol. (2017) 2:230–43. doi: 10.1136/svn-2017-000101

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Harerimana G, Kim JW, Yoo H, Jang B. Deep learning for electronic health records analytics. IEEE Access. (2019) 7:101245–59. doi: 10.1109/access.2019.2928363

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Fleuren LM, Klausch TLT, Zwager CL, Schoonmade LJ, Guo T, Roggeveen LF, et al. Machine learning for the prediction of sepsis: a systematic review and meta-analysis of diagnostic test accuracy. Intensive Care Med. (2020) 46:383–400. doi: 10.1007/s00134-019-05872-y

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Holzinger A, Jurisica I. Knowledge discovery and data mining in biomedical informatics: the future is in integrative, interactive machine learning solutions: In: Holzinger A, Jurisica I, editors. Interactive Knowledge Discovery and Data Mining in Biomedical Informatics. Lecture Notes in Computer Science, Vol. 8401. Berlin; Heidelberg: Springer (2014). p. 1–18. doi: 10.1007/978-3-662-43968-5_1

CrossRef Full Text | Google Scholar

18. Chen TQ, Guestrin C, Assoc Comp M. XGBoost: a scalable tree boosting system. In: Kdd'16: Proceedings of the 22nd Acm Sigkdd International Conference on Knowledge Discovery and Data Mining. San Francisco, CA (2016). p. 785–94. doi: 10.1145/2939672.2939785

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Wang J, Zhang L, Guo Q, Yi Z. Recurrent neural networks with auxiliary memory units. IEEE Trans Neural Netw Learn Syst. (2017) 29:1652–61. doi: 10.1109/TNNLS.2017.2677968

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Futoma J, Hariharan S, Heller K. Learning to detect sepsis with a multitask gaussian process RNN classifier. In: Doina P, Yee Whye T, editors. Proceedings of the 34th International Conference on Machine Learning. Vol. 70. Proceedings of Machine Learning Research: PMLR. Sydney, NSW (2017). p. 1174–82.

Google Scholar

21. Che Z, Purushotham S, Cho K, Sontag D, Liu Y. Recurrent neural networks for multivariate time series with missing values. Sci Rep. (2018) 8:6085. doi: 10.1038/s41598-018-24271-9

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Johnson AE, Pollard TJ, Shen L, Lehman LW, Feng M, Ghassemi M, et al. MIMIC-III, a freely accessible critical care database. Sci Data. (2016) 3:160035. doi: 10.1038/sdata.2016.35

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Rubanova Y, Chen RTQ, Duvenaud D. Latent ODEs for irregularly-sampled time series. In: Advances in Neural Information Processing Systems 32 (Nips 2019). Vol. 32. Vancouver, BC (2019).

Google Scholar

24. Christensen R. Linear and log-linear models. J Am Stat Assoc. (2000) 95:1290–3. doi: 10.1080/01621459.2000.10474332

CrossRef Full Text | Google Scholar

25. Noble WS. What is a support vector machine? Nat Biotechnol. (2006) 24:1565–7. doi: 10.1038/nbt1206-1565

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Ke GL, Meng Q, Finley T, Wang TF, Chen W, Ma WD, et al. LightGBM: a highly efficient gradient boosting decision tree. In: Guyon I, Luxburg UV, Bengio S, Wallach H, Fergus R, Vishwanathan S, Garnett R. Advances in Neural Information Processing Systems 30, Vol. 30. Long Beach, CA (2017).

Google Scholar

27. Sagi O, Rokach L. Approximating XGBoost with an interpretable decision tree. Inf Sci. (2021) 572:522–42. doi: 10.1016/j.ins.2021.05.055

CrossRef Full Text | Google Scholar

28. Yu Y, Si X, Hu C, Zhang J. A review of recurrent neural networks: LSTM cells and network architectures. Neural Comput. (2019) 31:1235–70. doi: 10.1162/neco_a_01199

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Lundberg SM, Lee SI. A unified approach to interpreting model predictions. In: Advances in Neural Information Processing Systems 30 (Nips 2017). Vol. 30. Long Beach, CA (2017). p. 4768–777.

Google Scholar

30. Zeiler MD, Fergus R. Visualizing and understanding convolutional networks. In: European Conference on Computer Vision. Zurich: Springer (2014). p. 818–33. doi: 10.1007/978-3-319-10590-1_53

CrossRef Full Text | Google Scholar

31. Favaloro EJ, Negrini D. Machine learning and coagulation testing: the next big thing in hemostasis investigations? Clin Chem Lab Med. (2021) 59:1177–9. doi: 10.1515/cclm-2021-0216

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Hasegawa D, Yamakawa K, Nishida K, Okada N, Murao S, Nishida O. Comparative analysis of three machine-learning techniques and conventional techniques for predicting sepsis-induced coagulopathy progression. J Clin Med. (2020) 9:2113. doi: 10.3390/jcm9072113

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Shickel B, Tighe PJ, Bihorac A, Rashidi P. Deep EHR: a survey of recent advances in deep learning techniques for electronic health record (EHR) analysis. IEEE J Biomed Health Inform. (2017) 22:1589–604. doi: 10.1109/JBHI.2017.2767063

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Chen RTQ, Rubanova Y, Bettencourt J, Duvenaud D. Neural ordinary differential equations. In: Advances in Neural Information Processing Systems 31 (Nips 2018). Vol. 31. Montréal, QC (2018).

Google Scholar

35. Alber M, Buganza Tepole A, Cannon WR De S, Dura-Bernal S, Garikipati K, et al. Integrating machine learning and multiscale modeling-perspectives, challenges, and opportunities in the biological, biomedical, and behavioral sciences. NPJ Digit Med. (2019) 2:115. doi: 10.1038/s41746-019-0193-y

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Qin Z, Yan L, Zhuang H, Tay Y, Pasumarthi RK, Wang X, et al. Are neural rankers still outperformed by gradient boosted decision trees? In: International Conference on Learning Representations (ICLR 2021). Vienna (2021).

Google Scholar

Keywords: SIC, sepsis-associated DIC, irregular time-series data, early real-time prediction, machine learning

Citation: Cui R, Hua W, Qu K, Yang H, Tong Y, Li Q, Wang H, Ma Y, Liu S, Lin T, Zhang J, Sun J and Liu C (2021) An Interpretable Early Dynamic Sequential Predictor for Sepsis-Induced Coagulopathy Progression in the Real-World Using Machine Learning. Front. Med. 8:775047. doi: 10.3389/fmed.2021.775047

Received: 13 September 2021; Accepted: 08 November 2021;
Published: 03 December 2021.

Edited by:

Zhongheng Zhang, Sir Run Run Shaw Hospital, China

Reviewed by:

Jesus Rico-Feijoo, Hospital Universitario Río Hortega, Spain
Takeshi Wada, Hokkaido University, Japan

Copyright © 2021 Cui, Hua, Qu, Yang, Tong, Li, Wang, Ma, Liu, Lin, Zhang, Sun and Liu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Jingyao Zhang, jingyaozhang@xjtu.edu.cn; Jian Sun, jiansun@xjtu.edu.cn; Chang Liu, liuchangfh@xjtu.edu.cn

These authors have contributed equally to this work

Download