Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Immunol., 26 November 2025

Sec. Autoimmune and Autoinflammatory Disorders: Autoinflammatory Disorders

Volume 16 - 2025 | https://doi.org/10.3389/fimmu.2025.1629600

A machine learning algorithm to predict treatment effectiveness for Kawasaki disease in China: a retrospective model development and validation study

Xuemei Li&#x;Xuemei Li1†Zihan Zhou&#x;Zihan Zhou1†Jingyi FanJingyi Fan1Lin ZhaoLin Zhao1Ruidi XuRuidi Xu1Dong LiDong Li1Xu MaXu Ma1Lu SunLu Sun2Yujian WuYujian Wu2Zhouping Wang*&#x;Zhouping Wang2*‡Ce Wang*&#x;Ce Wang1*‡
  • 1Department of Pediatrics, Shengjing Hospital of China Medical University, Shenyang, Liaoning, China
  • 2Department of Pediatric Cardiology, Guangzhou Women and Children’s Medical Center, Guangzhou Medical University, Guangzhou, Guangdong, China

Background: Kawasaki disease (KD) is the primary cause of acquired heart disease in children. Intravenous immunoglobulin (IVIG) is the first-line therapy for KD; however, IVIG resistance can occur. Reliable treatment efficacy prediction tools for Chinese patients are lacking, which this study aimed to address.

Methods: This retrospective cohort study enrolled patients diagnosed with KD admitted to Shengjing Hospital of China Medical University and collected data on 36 demographic, clinical, and laboratory parameters. Least Absolute Shrinkage and Selection Operator (LASSO) regression was used to identify key predictive variables. The dataset was divided into training (70%) and validation (30%) sets. Ten models were trained through 10-fold cross-validation, and the training set data were balanced using the ROSE method for oversampling. The performance of each model was evaluated using the area under the receiver operating characteristic curve (AUC), sensitivity, specificity, and accuracy. Patients with KD admitted to Guangzhou Women and Children’s Medical Centre, Guangzhou Medical University, between January 2023 and December 2024 were enrolled as an external validation cohort.

Results: The CatBoost machine learning algorithm achieved the best comprehensive results (AUC: 0·960; sensitivity: 0·883; specificity: 0·889, and accuracy: 0·887). The internal validation results with CatBoost were AUC: 0·862; 95% confidence interval [CI]: 0·6453–0·7651; sensitivity: 0·716; specificity: 0·877; and accuracy: 0·861. The external validation results were AUC: 0·834; 95% CI: 0·783–0·884; sensitivity: 0·817; specificity: 0·838, and accuracy: 0·835.

Conclusions: We present a machine learning model that can predict the risk of IVIG non-responsiveness in patients with KD in China. This model may help doctors develop personalized treatment strategies, thus improving the prognosis of KD.

1 Background

Kawasaki disease (KD) is an acute vasculitis that primarily affects children under five years old (1). KD primarily manifests as non-exudative bilateral conjunctival hyperemia, oral mucosal changes, non-purulent cervical lymphadenopathy, polymorphic rash, and acute-phase erythema or edema of the hands and feet, or subacute-phase periungual desquamation. Coronary artery lesions (CALs) are the most frequent complication of KD and the leading cause of acquired heart disease in children (2, 3). High-dose intravenous immunoglobulin (IVIG; 2 g/kg) significantly reduces the risk of KD and CALs (4, 5). However, 10–20% of patients do not respond to IVIG, substantially increasing the risk of developing CALs (6, 7). Moreover, more than one IVIG treatment places economic strain on the family. Thus, a tool to accurately predict IVIG non-responders could help develop early intervention strategies, implement more effective alternative treatments for high-risk patients, improve prognosis, and reduce economic pressures. Some scoring models for predicting IVIG resistance exist (810), but their practical application is limited, particularly in Northeast China. To date, prediction models for Chinese patients have been developed using regional populations, not national populations, and have not been externally validated.

Machine learning algorithms are advantageous in the medical field given their efficient data processing capabilities, excellent pattern recognition and prediction functions, and ability to handle complex non-linear relationships (11). More specifically, the pattern recognition and prediction performance of machine learning algorithms are remarkable, as they can identify hidden patterns in complex datasets and predict the likelihood of disease development and treatment effects, which has valuable implications for early disease prevention and the development of personalized treatment plans. Medical data are often multidimensional and involve complex non-linear relationships, and machine learning algorithms are particularly well-suited for handling this type of data because they can uncover complex interactions and effects among variables. However, this is difficult to achieve using traditional statistical methods.

Few machine learning applications have been developed for KD, and none are available on a national scale (12). Therefore, this study used advanced machine learning models to predict cases of IVIG resistance in KD.

2 Methods

2.1 Study population

Patients diagnosed with KD or incomplete KD from January 2012 to December 2024 in the Pediatric Department of Shengjing Hospital of China Medical University were retrospectively identified using the hospital’s electronic medical record system to generate the training and internal validation cohorts. Additionally, patients with KD admitted to Guangzhou Women and Children’s Medical Centre between January 2023 and December 2024 were enrolled as an external validation cohort. The requirement for informed consent was waived because this was a retrospective study.

Patients with KD were identified according to the 2017 American Heart Association guidelines (4). A classic KD diagnosis requires a fever lasting at least five days and at least four of the following clinical signs: non-exudative bilateral bulbar conjunctival injection; oral mucosal changes (such as red lips, strawberry tongue, or diffuse erythema of the oral and pharyngeal mucosa); non-suppurative cervical lymphadenopathy; polymorphous rash; and erythema or edema of the hands and feet during the acute phase or periungual desquamation in the subacute phase. The diagnostic criteria for incomplete KD: fever lasting five or more days is accompanied by only two or three principal clinical criteria. In such cases, the diagnosis is confirmed by supplemental laboratory criteria or the presence of coronary artery abnormalities detected by echocardiography.

To ensure KD was accurately diagnosed—especially in cases with clinical overlap—we implemented a three-step confirmation process: All patients, irrespective of infection status, were required to fulfill the 2017 AHA KD diagnostic criteria. Each case underwent systematic echocardiography to assess coronary artery status. Final diagnosis was confirmed by a pediatric rheumatologist or infectious disease specialist, who actively excluded alternative diagnoses based on overall clinical presentation.

Patients receiving irregular treatment, those receiving IVIG therapy at other institutions before admission, and those not receiving IVIG during hospitalization were excluded. Patients with severe coexisting systemic diseases (e.g. allergic vasculitis, systemic juvenile idiopathic arthritis, or macrophage activation syndrome), severe hematopoietic or immune system impairment, or comorbid malignant neoplasms were also excluded.

2.2 Definitions

IVIG-resistant KD (IVIGRKD) was defined as the recrudescence of fever (axillary temperature ≥37.3 °C) at any time ≥36 hours after the completion of the first IVIG infusion, accompanied by at least one of the five primary clinical manifestations of KD.

2.3 Data collection

The following data were collected from our hospital’s electronic medical record database:

1. General information: age (in months), sex, and admission date.

2. Clinical information: primary clinical manifestations (rash, Bacillus Calmette-Guérin [i.e. BCG] inoculation site erythema, conjunctival congestion, orofacial changes, enlarged cervical lymph nodes, erythema and edema of the hands and feet during the acute phase, or periungual desquamation in the subacute phase), fever duration, fever duration before the first IVIG dose and the presence of IVIGRKD.

3. Laboratory findings before the initial IVIG treatment: routine blood count (neutrophil percentage, white blood cell count, platelet count, hemoglobin level), erythrocyte sedimentation rate, and C-reactive protein, N-terminal brain natriuretic peptide (NT-proBNP) precursor, serum albumin (ALB), alanine aminotransferase, serum sodium ion, and serum total bilirubin (TBIL) levels. In cases where multiple measurements were available before the initial treatment, the maximum value was selected for neutrophil percentage, platelet count, white blood cell count, erythrocyte sedimentation rate, and C-reactive protein, NT-proBNP precursor, alanine aminotransferase, and serum TBIL levels. The minimum value was selected for serum sodium ion, serum ALB, and hemoglobin levels.

4. Infectious indices: presence of infection, mixed infections, mycoplasma, streptococcus, Epstein-Barr virus, herpes simplex, rotavirus, norovirus, adenovirus, parainfluenza virus, chlamydia, and coxsackievirus. Method of Infection Documentation: Pathogen detection was performed using a standardized protocol: PCR was used for Mycoplasma, Chlamydia, Epstein–Barr virus (EBV), cytomegalovirus (CMV), coxsackievirus (pharyngeal swab), and norovirus (stool). Rapid antigen testing was applied for group A streptococcus (throat) and rotavirus (stool); streptococcal cases also required supportive clinical features or confirmatory culture. All testing was conducted on samples collected within 1 week before or after hospital admission. Temporal Link to KD Onset: We defined a “concurrent infection” as a pathogen detected within the 7-day window surrounding admission (i.e., from 7 days before to 7 days after). This interval was chosen to capture infections closely associated with the acute febrile phase of KD, minimizing the inclusion of incidental or remote positives.

Clinical data for the external validation cohort included winter onset, chlamydial infections, conjunctival congestion, fever duration, IVIGR resistance status, and hemoglobin, NT-proBNP, serum ALB, and TBIL levels.

2.4 Statistical analyses

Statistical analyses were performed using R version 4.4.0 (R Core Team, Vienna, Austria). Categorical variables were analyzed using the χ² test; data are presented as frequencies (%). Non-normally distributed continuous variables were analyzed using the Mann–Whitney U test; data are summarized as medians and interquartile ranges. Statistical significance was set at p <0.05. Least Absolute Shrinkage and Selection Operator (LASSO) and multivariate logistic regression analyses were employed to identify the independent risk factors for IVIGRKD. The dataset was divided into training (70%) and testing (30%) sets. The ROSE method was applied for oversampling to address the class imbalance. All machine learning models were trained on the training set, including logistic regression, support vector machine (SVM), extreme gradient boosting (XGBoost), random forest, artificial neural network, gradient boosting machine (GBM), AdaBoost, LightGBM, CatBoost, and SVM with polynomial kernels. Receiver operating characteristic (ROC) curves were generated to assess the predictive performance of each model based on the area under the curve (AUC), sensitivity, specificity, accuracy, and 95% confidence interval (CI).

For the external validation cohort, severe missing values were deleted, and samples with fewer missing values were imputed (mean imputation was used for continuous variables and mode imputation was used for categorical variables).

2.5 Shapley Additive Explanations algorithm

The SHAP algorithm is a game theory-based graphical tool for interpreting machine learning model outputs (13). This approach is applied to overcome the ‘black box’ characteristics of machine learning, resulting in consistent model interpretability. Additionally, this method provides a clear framework for assessing the significance of individual features by quantifying their contributions to the model’s predictions. It identifies the influence of each feature on the results and offers a global interpretation that helps explain the model’s behavior as a whole. For instance, an interpretable machine learning approach for predicting long-term cardiovascular disease risk in teenagers was generated using the SHAP algorithm (14), providing a novel way to inform public health policies and clinical decision-making. Additionally, one study (15) provided a comprehensive evaluation of interpretable machine learning methods applied to electrocardiogram-based heart disease classification, which led to the development of a new explanatory framework for the early detection of heart disease. We used the SHAP algorithm to analyze the global and individual sample variables of the best-fit model to quantitatively visualize the relationship between risk factors and outcomes, thus enhancing the model’s credibility.

3 Results

3.1 Basic information

We retrospectively identified 5,218 patients with KD, of whom 4,704 met the inclusion and exclusion criteria and were enrolled in the study (2,901 boys and 1803 girls; male-to-female ratio: 1·63:1). The median age at disease onset was 25 months (interquartile range, 14–40 months). Patients whose body temperature normalized within 36 h after the first infusion were considered responsive to the IVIG treatment; 4,231 patients (89·9%) were responsive (i.e. IVIG-sensitive), and 473 (10·1%) were not (i.e. IVIGRKD) (Table 1).

Table 1
www.frontiersin.org

Table 1. Baseline table of clinical indicators for IVIG non-responsive and IVIG sensitive groups.

3.2 Feature selection

LASSO regression employs L1 regularization (λ penalty term) to shrink the coefficients of less influential variables to zero. This process selects the features with the strongest predictive power for the target variable, simplifying the model and enhancing its generalization capability. We chose the core variables corresponding to lambda.lse, that is, the penalty parameter that minimizes the model’s mean square error (or binomial deviation) during cross-validation; the variables screened by this parameter were used for subsequent model construction. LASSO analysis produced coefficient profiles for 36 features (Figure 1A), demonstrating the dynamic process of the LASSO analysis for screening variables. These 38 variables from the training set were included in the LASSO regression analysis, utilizing ten-fold cross-validation for model selection (Figure 1B). A λ value (contraction parameter) of 0·009174 was selected based on the cross-validation results, yielding 25 potential predictors, including age, sex, season, the main clinical manifestations of KD (e.g. non-suppurative conjunctivitis, cervical lymphadenopathy, polymorphous rash, acute-phase induration of hands and feet or subacute-phase periungual desquamation), fever duration, timing of the initial IVIG administration, the IVIG inactivation process, the incidence of IVIGRKD, laboratory parameters before the initial IVIG dose (e.g. hematologic indicators [hemoglobin level, platelet counts], erythrocyte sedimentation rate, and NT-proBNP, serum ALB, alanine aminotransferase, and serum TBIL levels), and the presence of concurrent infections, including streptococcus, rotavirus, parainfluenza, chlamydia, and coxsackievirus. Each of these variables has been linked to IVIG resistance.

Figure 1
Panel A shows a Lasso regression plot with coefficient paths for various variables against Log(lambda), each represented by different colors. Panel B displays a plot of binomial deviance against Log(lambda), with red dots indicating values and vertical dashed lines marking selected points.

Figure 1. LASSO regression analysis for feature selection in Kawasaki disease. (A) shows the LASSO coefficient paths for 36 features; (B) the selected λlse that minimizes the mean squared error, identifying 10 main predictive features for the model.

3.3 Univariate and multivariate logistic analyses of KD presenting with prophylactic unresponsiveness

To further screen relevant variables and exclude redundant terms, we performed univariate and multivariate logistic regression analyses based on the variables identified by the LASSO regression analysis. Table 2 presents the results of the univariate logistic regression analysis, which identified significant associations between several factors and IVIG resistance, including winter onset, chlamydia, fever duration, lips and oral, non-suppurative conjunctivitis, fever duration before the first IVIG dose, and hemoglobin, NT-proBNP, serum ALB, and TBIL levels (all p <0·05). Multivariate analysis confirmed winter, chlamydia infection, conjunctival congestion, fever duration, and hemoglobin, NT-proBNP, ALB, and TBIL levels as independent risk factors, suggesting that these variables may have an independent impact on the outcome variables.

Table 2
www.frontiersin.org

Table 2. The result of univariate and multivariate logistic regression and analysis for training set.

We incorporated the significantly associated variables (p <0·05) from the multivariate logistic analysis into the machine learning models to enhance the prediction and interpretation of the study’s outcome variables.

3.4 Model performance

The CatBoost model was the best-fit model based on performance evaluations across the training and validation sets (Table 3, Figure 2). We trained ten models (logistic regression, SVM, GBM, neural network, random forest, XGBoost, AdaBoost, LightGBM, CatBoost, and polynomial kernel SVM) through 10-fold cross-validation and balanced the training set data using the up-sampling technique. CatBoost achieved the highest AUC (0·960) in the training set, with a highest sensitivity of 0·883 and a highest specificity of 0·889. Furthermore, the overall accuracy of CatBoost was 0·887, indicating strong generalizability. The AUC for CatBoost in the validation set was 0·862 with a 95% CI of 0·827–0·898, indicating its stability with unseen data; the specificity (0·877), sensitivity (0·716), and accuracy (0·861) were also excellent. This balanced and consistent performance made CatBoost the preferred model.

Table 3
www.frontiersin.org

Table 3. Performance comparison of ten machine learning (ML) models.

Figure 2
Bar chart displaying global feature importance using SHAP values. The features ranked by importance are Days of fever, ALB, TBIL, BNP, HB, Winter, Con, and CP. Days of fever has the highest mean SHAP value, while CP has the lowest.

Figure 2. ROC curves for machine learning models predicting IVIG non-response in Kawasaki Disease.

3.5 Interpretation of the CatBoost model based on the SHAP analysis

3.5.1 Global sample feature interpretation

Figure 3 presents the SHAP feature importance ranking for the CatBoost model, illustrating the influence of various features on IVIG non-response predictions in patients with KD. The most influential predictors were fever duration and serum TBIL and ALB levels. Additionally, the SHAP contribution plot (Figure 4) revealed how these features affected individual risk predictions; the red dots represent a higher risk, whereas the blue dots represent a lower risk. Figure 4 displayed the value for each variable at which the predicted probability of IVIG resistance increases most steeply (i.e., the point where the first derivative of the SHAP value curve is maximized). These “maximum-slope” decision points are: Fever Duration: 7.42 days; Albumin: 25.40 g/L; Hemoglobin: 113.39 g/L; Total Bilirubin: 5.65 μmol/L; BNP: 421.90 pg/mL; These values represent quantitative anchors where the risk escalates most rapidly, offering clinical guidance without imposing rigid, binary cut-offs.

Figure 3
ROC curves for multiple models including Logistic, SVM, GBM, Neural Network, Random Forest, XGBoost, AdaBoost, LightGBM, CatBoost, and SVMPoly. The plot has sensitivity on the y-axis and one minus specificity on the x-axis, with most curves showing high accuracy near the top left.

Figure 3. SHAP feature importance analysis of the CatBoost model.

Figure 4
SHAP scatter plot shows feature contributions to a model's output. Features like CP, Con, Winter, and others are plotted against SHAP values, ranging from -2.5 to 5.0, with colors representing their values.

Figure 4. SHAP summary plot illustrating feature effects on IVIG non-response prediction.

3.5.2 Global sample feature interpretation

Figure 5 presents a SHAP waterfall plot. This plot illustrates, for any individual patient, how each feature contributes to shifting the log-odds of IVIG resistance and converts the model output f(x)into a specific probability.

Figure 5
SHAP waterfall plot showing prediction probability of 0.809. Factors like albumin, hemoglobin, total bilirubin, days of fever, season, and BNP contribute differently with values ranging from negative to positive impacts, leading to a final prediction value of 1.38.

Figure 5. SHAP waterfall plot.

3.5.3 External validation

The external validation cohort included 568 enrolled patients with KD at Guangzhou Women and Children’s Medical Center, Guangzhou Medical University. These data were used to externally validate the risk prediction model. The sensitivity, specificity, accuracy, and AUC values were 0·817, 0·838, 0·835, and 0·834 (95% CI: 0·783–0·884), respectively (Figure 6). These results reflect the model’s long-term continuity and spatial portability and confirm its generalizability and robustness.

Figure 6
ROC curve titled “External Validation Set ROC Curve for CatBoost Model” showing sensitivity versus specificity. The curve, in green, indicates high model performance, significantly above the diagonal line, suggesting effective classification. The legend identifies the curve as representing the external validation set.

Figure 6. External validation.

4 Discussion

Machine-learning algorithms are widely utilized in the diagnosis and prognosis of cardiovascular and cerebrovascular disorders, diabetes, and other conditions (1618). This study used machine learning algorithms to develop a clinical prediction model for IVIG unresponsiveness in patients with KD for the first time. The CatBoost model, based on a large dataset (n = 4,704) from patients in Northeast China, had the best performance of ten machine learning algorithms. Internal validation of the model using 10-fold cross-validation confirmed its excellent performance and stability, and the external validation results reflect the model’s generalizability and robustness, making the prediction model applicable to the whole country.

This study provides solid support for assessing the risk factors for IVIG unresponsiveness in patients with KD in China. Our evaluation of ten machine learning models resulted in a reliable prediction tool for clinical use that, unlike prior KD prediction models (810), was interpretable and could visualize each variable’s impact on the prediction findings. To build the model, key characteristics were screened using LASSO regression to eliminate confounding factors, followed by univariable and multivariable logistic regression analyses. To further understand the fundamental aspects of IVIG resistance, we used the SHAP algorithm to analyze the CatBoost model, which identified eight features with a considerable impact on IVIG resistance: winter onset, chlamydial infections, conjunctival congestion, fever duration, and hemoglobin, NT-proBNP, serum ALB, and TBIL levels. The top three variables influencing the model’s predictions were serum TBIL and ALB levels and fever duration.

CatBoost is a gradient-boosting algorithm for categorical data and high-dimensional features (19). Its intrinsic properties make it one of the best choices for real-world applications dealing with categorical variables efficiently while avoiding time-consuming data preprocessing (20). CatBoost significantly suppresses overfitting and enhances the model’s generalization capabilities by employing a novel stochastic alignment technique, particularly when data are restricted. Compared with other gradient boosting algorithms (for example, XGBoost), CatBoost is more efficient and less demanding for hyperparameter adjustment, trains more quickly, and creates superior models (21). The AUCs for the training and validation sets for our CatBoost model were 0·960 and 0·862, respectively, indicating minimal overfitting and good generalizability. The accuracy, sensitivity, and specificity of the CatBoost model were significantly higher than those of other models, making it more suited for the balanced detection of IVIG-sensitive and IVIG-unresponsive events. Finally, we externally verified our CatBoost risk prediction model using data from 568 prospectively enrolled patients with KD at Guangzhou Women and Children’s Medical Center. The sensitivity, specificity, accuracy, and AUC values were 0·817, 0·838, 0·835, and 0·834 (95% CI: 0·783–0·884), respectively, confirming the model’s generalizability and robustness.

Interestingly, this study discovered for the first time that winter and chlamydial infections significantly enhanced the probability of IVIG resistance, validating the idea that environmental variables affect the IVIG response (22). Winter climate and environmental changes may significantly influence immunological functions, contributing to the development of IVIG resistance in patients with KD. Although winter has previously been identified as the peak season for KD (23), our results imply that an elevated risk of winter onset may be linked to the incidence of winter respiratory infections. Respiratory infection pathogens (e.g. influenza virus and adenovirus) in winter cause an increased inflammatory response, increasing the likelihood of IVIG resistance (24). Chlamydial infections are also more frequent in winter, and the systemic inflammatory response they cause affects the course of KD by stimulating the host’s immune system (2527). Chlamydial infection induces the release of pro-inflammatory cytokines, such as interleukin (IL)-6 and tumour necrosis factor-alpha (TNF-α), and activates the innate immune response, which subsequently amplifies inflammation, particularly in patients with IVIG-unresponsive KD (28). Chlamydial infection induces a Th1-type immune response, decreasing the anti-inflammatory effects of IVIG (29). Furthermore, immunological changes, such as an active complement system and enhanced macrophage responses during winter, may impair the effectiveness of IVIG (26, 27). Such immune system modifications aggravate the pathogenic potential of KD and reduce the treatment efficacy. Infection-driven hyperinflammatory responses activate multiple immune signaling pathways, including TNF-α and IL-1, which are pivotal in the pathogenesis of IVIG-resistant KD. Infection-induced cytokine release stimulates the nuclear factor kappa-light-chain-enhancer of activated B-cells (NF-κB) pathway, which induces a pro-inflammatory response and causes vascular endothelial damage and coronary artery disease. The NF-κB signaling pathway is related to winter-onset viral infections (30). Therefore, identifying winter onset and chlamydial infection as risk factors for IVIG non-responsiveness provides crucial guidelines for treating children with KD.

We also identified elevated TBIL levels as a risk factor for IVIG resistance in patients with KD. High bilirubin levels may indicate a more severe hepatic inflammatory burden, suggesting disease aggravation (31). Sunaga et al. (32) found that a low TBIL level before IVIG therapy indicated a poor immunological response, which might be related to IVIG resistance. Low bilirubin levels indicate insufficient inflammatory pathway activation, which may result in poor IVIG effectiveness. Children in the Tohoku area may be more prone to cholestasis and hyperbilirubinemia. However, Japanese individuals with Sunaga Y disease exhibit different pathological characteristics. Bilirubin has immunomodulatory properties and acts as an anti-inflammatory agent by blocking T-cell activation and decreasing oxidative stress (33). Low bilirubin levels may be ineffective in reducing excessive inflammation, resulting in IVIG resistance (34). In contrast, elevated bilirubin levels may indicate serious hepatic damage, impairing bile excretion due to increased inflammation and increasing the risk of IVIG therapy failure (35). This biphasic effect may explain the conflicting presentation of TBIL levels and IVIG resistance in several studies.

This study has one key limitation. The development phase data and the external validation data in this study were all collected retrospectively, potentially limiting the ability to control for confounding factors. Future studies should address this issue by including more diverse datasets and using prospective study designs. And this study applied the 2017 AHA diagnostic criteria for consistency. Future prospective validation will include patients diagnosed under the latest standards to further confirm the model’s generalizability.

5 Conclusion

This study analyzed the risk factors for IVIG resistance in Chinese patients with KD and constructed a robust predictive CatBoost model with enhanced interpretation via SHAP analysis. Our predictive model performed well and, for the first time, revealed associations between winter-onset KD, Chlamydia infection, and IVIG resistance. We also externally validated the model, demonstrating its generalizability and robustness. This model may help physicians better manage patients with KD, By enabling the early identification of patients at high risk for IVIG resistance, this model facilitates timely treatment intensification. This could involve the initial co-administration of glucocorticoids with IVIG or the prompt initiation of biologic agents upon early signs of treatment failure, rather than awaiting recrudescent fever to administer a second IVIG course. Such a proactive strategy aims to shorten the total duration of inflammation, which is a key driver of coronary artery lesions, thereby potentially reducing the incidence of this serious complication, avoiding the waste of medical resources, and alleviating the economic burden on families.

Data availability statement

The datasets generated and analyzed during the current study are not publicly available due to privacy or ethical restrictions. However, de-identified data are available from the corresponding author upon reasonable request. Requests to access the datasets should be directed to CW, Y3JlYXNlLnN5QDE2My5jb20=.

Ethics statement

This study was approved by the Medical Ethics Committee of Shengjing Hospital of China Medical University Institutional Review Board with a waiver for informed consent (Ethics Approval Number: KYCS2024771). The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation in this study was provided by the participants’ legal guardians/next of kin.

Author contributions

XL: Data curation, Formal Analysis, Writing – original draft, Writing – review & editing, Investigation, Validation. ZZ: Data curation, Formal Analysis, Software, Validation, Writing – original draft. JF: Data curation, Writing – original draft. LZ: Data curation, Writing – original draft. RX: Data curation, Writing – original draft. DL: Data curation, Writing – original draft. XM: Data curation, Writing – original draft. LS: Data curation, Writing – original draft. YW: Data curation, Writing – original draft. ZW: Data curation, Writing – review & editing. CW: Conceptualization, Formal Analysis, Methodology, Project administration, Resources, Supervision, Writing – review & editing.

Funding

The author(s) declare that no financial support was received for the research, and/or publication of this article.

Acknowledgments

The authors thank Professor Zhihua Yin from the Clinical Epidemiology Department of China Medical University for providing guidance on the statistical analysis.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fimmu.2025.1629600/full#supplementary-material

References

1. Newburger JW, Takahashi M, and Burns JC. Kawasaki disease. J Am Coll Cardiol. (2016) 67:1738–49. doi: 10.1016/j.jacc.2015.12.073

PubMed Abstract | Crossref Full Text | Google Scholar

2. Sosa T, Brower L, and Divanovic A. Diagnosis and management of kawasaki disease. JAMA Pediatr. (2019) 173:278–9. doi: 10.1001/jamapediatrics.2018.3307

PubMed Abstract | Crossref Full Text | Google Scholar

3. Mossberg M, Mohammad AJ, Kahn F, Segelmark M, and Kahn R. High risk of coronary artery aneurysm in Kawasaki disease. Rheumatol (Oxford). (2021) 60:1910–4. doi: 10.1093/rheumatology/keaa512

PubMed Abstract | Crossref Full Text | Google Scholar

4. McCrindle BW, Rowley AH, Newburger JW, Burns JC, Bolger AF, Gewitz M, et al. diagnosis, treatment, and long-term management of kawasaki disease: A scientific statement for health professionals from the american heart association. circulation. (2017) 135:e927–99. doi: 10.1161/CIR.0000000000000484

PubMed Abstract | Crossref Full Text | Google Scholar

5. Li Y, Xu Z, Wu L, Liang X, Zhao L, Liu F, et al. Tenascin-C predicts IVIG non-responsiveness and coronary artery lesions in Kawasaki disease in a Chinese cohort. Front Pediatr. (2022) 10:979026. doi: 10.3389/fped.2022.979026

PubMed Abstract | Crossref Full Text | Google Scholar

6. J.J.W, Group. Guidelines for diagnosis and management of cardiovascular sequelae in Kawasaki disease (JCS 2013), Digest version. Circ J. (2014) 78:2521–62. doi: 10.1253/circj.CJ-66-0096

PubMed Abstract | Crossref Full Text | Google Scholar

7. Ogata S, Bando Y, Kimura S, Ando H, Nakahata Y, Ogihara Y, et al. et al, The strategy of immune globulin resistant Kawasaki disease: a comparative study of additional immune globulin and steroid pulse therapy. J Cardiol. (2009) 53:15–9. doi: 10.1016/j.jjcc.2008.08.002

PubMed Abstract | Crossref Full Text | Google Scholar

8. Kobayashi T, Inoue Y, Takeuchi K, Okada Y, Tamura K, Tomomasa T, et al. Prediction of intravenous immunoglobulin unresponsiveness in patients with Kawasaki disease. Circulation. (2006) 113:2606–12. doi: 10.1161/CIRCULATIONAHA.105.592865

PubMed Abstract | Crossref Full Text | Google Scholar

9. Sano T, Kurotobi S, Matsuzaki K, Yamamoto T, Maki I, Miki K, et al. Prediction of non-responsiveness to standard high-dose gamma-globulin therapy in patients with acute Kawasaki disease before starting initial treatment. Eur J Pediatr. (2007) 166:131–7. doi: 10.1007/s00431-006-0223-z

PubMed Abstract | Crossref Full Text | Google Scholar

10. Li C, Wu S, Shi Y, Liao Y, Sun Y, Yan H, et al. establishment and validation of a multivariate predictive scoring model for intravenous immunoglobulin-resistant kawasaki disease: A study of children from two centers in China. Front Cardiovasc Med. (2022) 9:883067. doi: 10.3389/fcvm.2022.883067

PubMed Abstract | Crossref Full Text | Google Scholar

11. Han Y, Xie X, Qiu J, Tang Y, Song Z, Li W, et al. Early prediction of sepsis associated encephalopathy in elderly ICU patients using machine learning models: a retrospective study based on the MIMIC-IV database. Front Cell Infect Microbiol. (2025) 15:1545979. doi: 10.3389/fcimb.2025.1545979

PubMed Abstract | Crossref Full Text | Google Scholar

12. Theunissen M and Browning J. Putting explainable AI in context: institutional explanations for medical AI. Ethics Inf Technol. (2022) 24:23. doi: 10.1007/s10676-022-09649-8

PubMed Abstract | Crossref Full Text | Google Scholar

13. Petch J, Di S, and Nelson W. Opening the black box: the promise and limitations of explainable machine learning in cardiology. Can J Cardiol. (2022) 38:204–13. doi: 10.1016/j.cjca.2021.09.004

PubMed Abstract | Crossref Full Text | Google Scholar

14. Salah H and Srinivas S. Explainable machine learning framework for predicting long-term cardiovascular disease risk among adolescents. Sci Rep. (2022) 12:21905. doi: 10.1038/s41598-022-25933-5

PubMed Abstract | Crossref Full Text | Google Scholar

15. Ayano YM, Schwenker F, Dufera BD, and Debelee TG. Interpreta ble machine learning techniques in ECG-based heart disease classification: A systematic review. Diagnostics (Basel). (2022) 13:111. doi: 10.3390/diagnostics13010111

PubMed Abstract | Crossref Full Text | Google Scholar

16. Sun Z, Dong W, Shi H, Ma H, Cheng L, and Huang Z. Comparing machine learning models and statistical models for predicting heart failure events: A systematic review and meta-analysis. Front Cardiovasc Med. (2022) 9:812276. doi: 10.3389/fcvm.2022.812276

PubMed Abstract | Crossref Full Text | Google Scholar

17. Bock C, Walter JE, Rieck B, Strebel I, Rumora K, Schaefer I, et al. Enhancing the diagnosis of functionally relevant coronary artery disease with machine learning. Nat Commun. (2024) 15:5034. doi: 10.1038/s41467-024-49390-y

PubMed Abstract | Crossref Full Text | Google Scholar

18. Lee WV, Song Y, Chun JS, Ko M, Jang HY, Kim IW, et al. Development of a machine learning model for precision prognosis of rapid kidney function decline in people with diabetes and chronic kidney disease. Diabetes Res Clin Pract. (2024) 217:111897. doi: 10.1016/j.diabres.2024.111897

PubMed Abstract | Crossref Full Text | Google Scholar

19. Ajin RS, Segoni S, and Fanti R. Optimization of SVR and CatBoost models using metaheuristic algorithms to assess landslide susceptibility. Sci Rep. (2024) 14:24851. doi: 10.1038/s41598-024-72663-x

PubMed Abstract | Crossref Full Text | Google Scholar

20. Zhang M, Shen T, Li Y, Li Q, and Lou Y. Exploring the complex associations between community public spaces and healthy aging: an explainable analysis using catboost and SHAP. BMC Public Health. (2025) 25:2200. doi: 10.1186/s12889-025-23402-y

PubMed Abstract | Crossref Full Text | Google Scholar

21. Sumathi M, Sahana S, Rajeswari SSR, Kruthi V, and Raja SP. Enhancing diabetes prediction with minimal processing time using catboost: a comparative study. J Eval Clin Pract. (2025) 31:e70284. doi: 10.1111/jep.70284

PubMed Abstract | Crossref Full Text | Google Scholar

22. Yılmazer MM, Özdemir R, Meşe T, Küçük M, Öner T, Devrim İ, et al. Kawasaki disease in Turkish children: a single center experience with emphasis on intravenous immunoglobulin resistance and giant coronary aneurysms. Turk J Pediatr. (2019) 61:648–56. doi: 10.24953/turkjped.2019.05.002

PubMed Abstract | Crossref Full Text | Google Scholar

23. Makino N, Nakamura Y, Yashiro M, Ae R, Tsuboi S, Aoyama Y, et al. Descriptive epidemiology of Kawasaki disease in Japan, 2011-2012: from the results of the 22nd nationwide survey. J Epidemiol. (2015) 25:239–45. doi: 10.2188/jea.JE20140089

PubMed Abstract | Crossref Full Text | Google Scholar

24. Dionne A, Le CK, Poupart S, Autmizguine J, Meloche-Dumas L, Turgeon J, et al. Profile of resistance to IVIG treatment in patients with Kawasaki disease and concomitant infection. PloS One. (2018) 13:e0206001. doi: 10.1371/journal.pone.0206001

PubMed Abstract | Crossref Full Text | Google Scholar

25. Narita M. Classification of extrapulmonary manifestations due to mycoplasma pneumoniae infection on the basis of possible pathogenesis. Front Microbiol. (2016) 7:23. doi: 10.3389/fmicb.2016.00023

PubMed Abstract | Crossref Full Text | Google Scholar

26. Nagata S. Causes of kawasaki disease-from past to present. Front Pediatr. (2019) 7:18. doi: 10.3389/fped.2019.00018

PubMed Abstract | Crossref Full Text | Google Scholar

27. Strigl S, Kutlin A, Roblin PM, Shulman S, and Hammerschlag MR. Is there an association between Kawasaki disease and Chlamydia pneumoniae? J Infect Dis. (2000) 181:2103–5. doi: 10.1086/315526

PubMed Abstract | Crossref Full Text | Google Scholar

28. Kuo HC, Yang KD, Chang WC, Ger LP, and Hsieh KS. Kawasaki disease: an update on diagnosis and treatment. Pediatr Neonatol. (2012) 53:4–11. doi: 10.1016/j.pedneo.2011.11.003

PubMed Abstract | Crossref Full Text | Google Scholar

29. Puri BK, Lee GS, and Schwarzbach A. The Role of Chlamydia pneumoniae in the Aetiology of Autoimmune Diseases. Cureus. (2023) 15:e49095. doi: 10.7759/cureus.49095

PubMed Abstract | Crossref Full Text | Google Scholar

30. Liu T, Zhang L, Joo D, and Sun SC. NF-κB signaling in inflammation. Signal Transduct Target Ther. (2017) 2:17023–25. doi: 10.1038/sigtrans.2017.23

PubMed Abstract | Crossref Full Text | Google Scholar

31. Ostapowicz G, Fontana RJ, Schiødt FV, Larson A, Davern TJ, Han SH, et al. Results of a prospective study of acute liver failure at 17 tertiary care centers in the United States. Ann Intern Med. (2002) 137:947–54. doi: 10.7326/0003-4819-137-12-200212170-00007

PubMed Abstract | Crossref Full Text | Google Scholar

32. Sunaga Y, Watanabe A, Katsumata N, Toda T, Yoshizawa M, Kono Y, et al. A simple scoring model based on machine learning predicts intravenous immunoglobulin resistance in Kawasaki disease. Clin Rheumatol. (2023) 42:1351–61. doi: 10.1007/s10067-023-06502-1

PubMed Abstract | Crossref Full Text | Google Scholar

33. Juping D, Yuan Y, Shiyong C, Jun L, Xiuxiu Z, Haijianet Y, et al. Serum bilirubin and the risk of rheumatoid arthritis. J Clin Lab Anal. (2017) 31:e22118. doi: 10.1002/jcla.22118

PubMed Abstract | Crossref Full Text | Google Scholar

34. Yan Y, Qiao L, Hua Y, Shao S, Zhang N, Wu M, et al. Predictive value of Albumin-Bilirubin grade for intravenous immunoglobulin resistance in a large cohort of patients with Kawasaki disease: a prospective study. Pediatr Rheumatol Online J. (2021) 19:147. doi: 10.1186/s12969-021-00638-7

PubMed Abstract | Crossref Full Text | Google Scholar

35. Ollinger R, Bilban M, Erat A, Froio A, McDaid J, Tyagi S, et al. Bilirubin: a natural inhibitor of vascular smooth muscle cell proliferation. Circulation. (2005) 112:1030–9. doi: 10.1161/CIRCULATIONAHA.104.528802

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: Kawasaki disease, IVIG resistance, machine learning, SHAP algorithm, intravenous immunoglobulin

Citation: Li X, Zhou Z, Fan J, Zhao L, Xu R, Li D, Ma X, Sun L, Wu Y, Wang Z and Wang C (2025) A machine learning algorithm to predict treatment effectiveness for Kawasaki disease in China: a retrospective model development and validation study. Front. Immunol. 16:1629600. doi: 10.3389/fimmu.2025.1629600

Received: 16 May 2025; Accepted: 12 November 2025; Revised: 30 October 2025;
Published: 26 November 2025.

Edited by:

Daniela Bosisio, University of Brescia, Italy

Reviewed by:

Paul Anthony Brogan, University College London, United Kingdom
Marco Cattalini, University of Brescia, Italy

Copyright © 2025 Li, Zhou, Fan, Zhao, Xu, Li, Ma, Sun, Wu, Wang and Wang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Ce Wang, Y3JlYXNlLnN5QDE2My5jb20=; Zhouping Wang, d2FuZ196aG91cGluZ0Bnd2NtYy5vcmc=

These authors have contributed equally to this work

These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.