- 1Department of Hematology/Oncology, The Affiliated Children’s Hospital of Xiangya School of Medicine, Central South University (Hunan Children’s Hospital), Changsha, China
- 2Department of Hematology, Wuhan Children’s Hospital, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China
- 3Beijing BFR Gene Diagnostics, Beijing, China
- 4Department of Pediatric Hematology and Oncology, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, China
Introduction: Acute graft-versus-host disease (aGVHD) is a major contributor to non-relapse mortality (NRM) in pediatric patients undergoing allogeneic hematopoietic stem cell transplantation (allo-HSCT). Although the MAGIC algorithm has been validated in adults, its predictive value in children remains insufficiently explored.
Methods: We conducted a prospective multicenter cohort study including 105 Chinese pediatric allo-HSCT recipients diagnosed with aGVHD between May 2019 and August 2023. Endpoints were 6-month NRM, overall survival (OS), and Day-28 treatment response. Multivariable analyses incorporated clinical variables together with the Panel 2 score, hereafter referred to as Panel 2,using Cox regression for NRM/OS and logistic regression for treatment response.
Results: Age ≥12 years (hazard ratio 4.36, 95% CI 1.62–11.75; P=0.003) and a high Panel 2 score (HR 3.09, 95% CI 1.08–8.82; P=0.035) were independent predictors of 6-month NRM and OS. The high-risk (HR) group, defined by the combination of age ≥12 years and a high Panel 2 score, had markedly higher NRM than the low-risk (LR) group (71% vs 12.2%; HR 5.00, 95% CI 1.75–9.56; P=0.001) and significantly worse OS (P<0.001). Panel 2 was also predictive of Day-28 treatment response, with lower CR/PR rates in the high versus low group (62% vs 92%; P<0.001).
Discussion: The Panel 2 score effectively predicted NRM, OS, and treatment response in pediatric aGVHD. Incorporating age ≥12 years further enhanced risk stratification, enabling clear separation between HR and LR groups. These findings support the potential clinical utility of this combined model and warrant validation in larger, international pediatric cohorts.
Introduction
Allogeneic hematopoietic stem cell transplantation (allo-HSCT) is a potential curative approach for numerous pediatric patients with both malignant and nonmalignant conditions. Despite advances in transplant techniques and supportive care, acute graft-versus-host disease (aGVHD) remains a severe complication and a substantial cause of morbidity and non-relapse mortality (NRM). Moderate-to-severe aGVHD affects approximately 13 - 47% of pediatric allo-HSCT recipients and is associated with poor outcomes (1). Conventional prognostic approaches—based primarily on clinical grading at aGVHD onset together with standard transplant-related parameters—have limited accuracy in predicting critical outcomes such as NRM and overall survival (OS). This limitation can result in overtreatment of low-risk patients and undertreatment of high-risk patients. Consequently, there is growing interest in developing objective, biomarker-based tools that enable earlier and more accurate risk stratification (2, 3).
For more than a decade, high-throughput detection methods (e.g., proteomic mass spectrometry, cytomics assays, multiplex immunoassays, and array-based assays) have uncovered numerous biomarkers related to aGVHD, significantly advancing the understanding of the complex pathophysiology of aGVHD (4, 5). These biomarkers have been investigated as individual markers (6–8), in composite panels (9), and in biomarker algorithms (10–12). Recent studies have also explored integrating biomarker profiles with clinical and transplantation-related factors using machine learning, offering an expanded approach and representing one of the emerging trends in risk prediction for allo-HSCT recipients (13).
Years of research have progressed from initially screening numerous candidate biomarkers to recently focusing on a few key proteins. Several core markers have since emerged as particularly relevant in acute GVHD. These include soluble suppression of tumorigenicity 2 (sST2), regenerating islet–derived protein 3α (REG3α), tumor necrosis factor receptor 1(TNFR1), interleukin-6 (IL-6), and T cell immunoglobulin and mucin domain–containing protein 3 (TIM3) (11, 14). Of these, sST2 and REG3α have undergone extensive internal or external validation, and algorithms combining these two markers, known as the MAGIC algorithm, have demonstrated predictive value.
Notably, the MAGIC algorithm developed by the Mount Sinai Acute GVHD International Consortium (12) to as Panel 2 in this study, has been most extensively evaluated. It stratifies patients into risk groups either on day 7 post-transplant or at aGVHD onset to predict 6-month NRM. Multiple studies have validated its prognostic utility; furthermore, a randomized multicenter trial adopted the Panel 2 as a stratification criterion for therapeutic intervention (15). However, most of these validations have been conducted in adult Western cohorts, and their applicability to Chinese pediatric populations remains uncertain. Moreover, some external validations for this algorithm have yielded mixed results. One study reported differences in clinical applicability (16). Another found limited prognostic value (17). Additionally, a pediatric cohort analysis indicated that neither sST2 nor REG3α were effective markers for aGVHD diagnosis or prognosis (18).
Building on these advancements, we conducted a prospective multicenter study enrolling Chinese pediatric recipients of allo-HSCT who developed aGVHD. Plasma levels of sST2, REG3α, sTNFR1, IL - 6, and IL - 8 were measured at aGVHD onset. This study aimed to externally validate the MAGIC Consortium’s Panel 2 (sST2 + REG3α) algorithm in a Chinese pediatric multicenter cohort, given its extensive validation in adult and mixed-age populations.
This work represents the first prospective multicenter validation of Panel 2 in this population and explores its potential enhancement through integration of relevant clinical risk factors, with the goal of supporting more individualized post-aGVHD management in pediatric allo-HSCT recipients. In this paper, we describe the biomarker measurements and the determination of the Panel 2 threshold; validate the prognostic utility of Panel 2 by assessing its capacity to stratify patients by 6-month NRM risk and overall survival (OS), as well as to predict treatment response; and evaluate whether incorporating relevant clinical risk factors into the biomarker model could improve prognostic performance. We also discuss the clinical implications, biological rationale, limitations, and directions for future research.
Methods
Patient/study population
This multicenter, prospective, observational cohort study consecutively enrolled pediatric patients (aged <18 years) with de novo aGVHD following allo-HSCT. The study was conducted at three Chinese tertiary centers: Hunan Children’s Hospital (n=26), Wuhan Children’s Hospital (n=42), and the First Affiliated Hospital of Zhengzhou University (n=36). The enrollment period extended from May 2019 to August 2023. Patients were monitored for aGVHD development for 100 days post-transplantation, with follow-up continuing through March 2024 to assess endpoints: the 6-month cumulative incidence of NRM, OS, and aGVHD treatment response. The study complied with the Declaration of Helsinki and was approved by the Ethics Committee of Hunan Children’s Hospital (Approval No. KY2021 - 55); written informed consent was obtained from all guardians.
Endpoint and definition
MAGIC criteria were used to diagnose and grade aGVHD (19), and management followed each center’s institutional protocols. Briefly, first-line therapy consisted of glucocorticoids, with second-line agents introduced for suboptimal steroid responses. The primary endpoint was 6-month NRM, NRM defined as death attributable to causes other than underlying disease relapse/progression. Secondary endpoints included treatment response by Day 28 and OS. Complete response (CR) was defined as the complete resolution of aGVHD manifestations in all involved target organs. Partial response (PR) was defined as improvement in symptoms without full resolution in at least one target organ and no worsening in any other. Progressed Disease (PD) was defined as the worsening of acute GVHD in at least one target organ (by at least one stage), with or without concurrent improvement in other organs. Patients were classified as non-responders (NR) if their aGVHD symptoms failed to improve or worsened after systemic corticosteroids, if they required additional systemic immunosuppression for aGVHD, or if they died within the first 4 weeks of treatment. OS was defined as the time from allo-HSCT to death from any cause (censored at last follow-up March 2024).
Biomarker measurement and panel scoring
Blood samples from each research center were collected in EDTA anticoagulant tubes, transported on ice (4 °C) to a single central laboratory (Guang Zhou BofuRui Biolaboratory) within 48 hours, and centrifuged at 1,500 × g for 10 min at 4 °C to separate serum, which was assayed within 2 hours for sST2, REG3α, sTNFR1, IL - 6, IL - 8 using a customized cytokine detection kit (catalog no. LXSAHM-05) on the Luminex® 200™ system employing xMAP technology according to the manufacturer’s instructions; Predicted 6-month NRM probabilities were obtained using the previously published MAGIC algorithm for Panel 2, as follows: log[−log(1−p)] = −11.263 + 1.844 × log10(ST2) + 0.577 × log10(REG3α), where p denotes the predicted probability of 6-month NRM.
Statistical analysis
Box plots were used to display the distributions of log-transformed plasma ST2 and REG3α concentrations, as well as their derived Panel 2 scores. Because our assay platforms differed from those used in the original studies, the distribution of Panel 2 scores was also different.
Similar to best practices in predictive modeling (20, 21), we sought to minimize overfitting risk by defining the Panel 2 threshold using the 75th percentile (Q3) of our cohort rather than optimizing cutoffs to maximize performance in the training data. This approach avoids tailoring the model to idiosyncrasies of our sample distribution and ensures broader applicability. Additionally, we examined whether biomarker–outcome associations followed clinically plausible trends, analogous to external parametric trend validation in prior modeling studies, thereby providing further assurance that the selected threshold was not overfit to our dataset.
Continuous variables are presented as median (interquartile range) and compared between the Panel 2 high and low groups by the Mann–Whitney U test. Categorical variables are expressed as number (percentage) and compared by Pearson’s chi-square test or Fisher’s exact test, as appropriate.
We selected commonly recognized clinical prognostic factors and the MAGIC Panel 2 risk group for survival and treatment response analysis. Univariate Cox proportional-hazards models were used to estimate hazard ratios (HRs) and 95% confidence intervals for each variable, and those with P < 0.10 were considered candidates for multivariable modeling. If more than a manageable number of clinical factors met this threshold, a stepwise selection approach was applied to limit the covariate set and avoid overfitting. To preserve model stability and remain consistent with our focus on external validation of the Panel 2 algorithm, no additional exploratory biomarkers were included; only the Panel 2 score together with at most one or two key clinical factors were retained in the multivariable analyses.
Since only three patients experienced relapse, parameter estimates in a competing-risks model would be unstable. Therefore, we treated relapse events as censored and estimated NRM and OS using the Kaplan–Meier method, with differences between groups assessed by the log-rank test and hazard ratios derived from a Cox proportional-hazards model. Day 28 treatment-response rates across the risk groups were visualized with stacked bar charts and compared using the chi-square test or Fisher’s exact test, as appropriate. To evaluate predictive performance, decision-curve analysis (DCA) was conducted for the clinical model, the Panel 2 model, and the combined model to assess net clinical benefit, and receiver operating characteristic (ROC) curves were generated for the same models to compare their discriminative ability.
All analyses were conducted in R version 4.4.1 (R Foundation for Statistical Computing) using the survival and dcurves packages. Two-sided P < 0.05 was considered statistically significant (22).
Results
Patient characteristics and transplant overview
This cohort included 105 consecutively enrolled pediatric recipients of allo-HSCT; their baseline characteristics are summarized in Table 1. The median age was 7 years (1–18); 78.1% were younger than 12 years and 21.9% were 12 years or older. Underlying diagnoses included non-malignant disorders in 54.3% patients and malignant hematologic diseases in 45.7%. Seventy-six patients (72.4%) received grafts from HLA-mismatched donors, of whom 46 (60.5%) underwent related haploidentical transplantation. Peripheral blood stem cells were the primary graft source in 88.6% of cases; a subset of patients also received supplemental donor bone marrow and/or third-party umbilical cord blood. A large majority received myeloablative conditioning and calcineurin inhibitor-based GVHD prophylaxis.
Acute GVHD manifested at a median of 24 days post-transplant (IQR 15–36 days). Initial severity at diagnosis was grade I: 30 patients (28.6%), grade II: 59 (56.2%), grade III: 15 (14.3%), and grade IV: 1 (0.9%). During follow-up, maximal severity reached grade I: 20 (19%), grade II: 30 (28.6%), grade III: 26 (24.8%), and grade IV: 29 (27.6%) (Supplementary Table S1). At 28 days after steroid initiation, 89 (84.8%) achieved complete or partial response (CR/PR), 16 (15.2%) had no response or progressive disease (NR/PD). During subsequent follow-up, 3 patients (2.9%) experienced disease relapse, including 2 (1.9%) relapse-related deaths. At 6 months, 17 patients experienced non-relapse mortality (NRM), corresponding to a cumulative incidence of 16.2%. The aGVHD grade at onset, donor type, and ATG dose in the conditioning regimen differ significantly between the high- and low-Panel 2 groups.
Association of clinical factor and biomarkers panel with outcomes
Supplementary Figure 1 shows the distributions of log-transformed sST2, REG3α, IL - 6, IL - 8, sTNFR1, and the Panel 2 score. The wide interquartile ranges (IQRs) and presence of extreme values across all biomarkers indicate considerable variability and adequate coverage of clinically relevant value ranges. Figure 1 shows log-transformed ST2 and REG3α, and panel 2 scores were all significantly higher in the NRM group than in the non-NRM (all P < 0.05). Univariate analysis demonstrated that for clinical variables, patients’ aged ≥12 years and aGVHD grade III–IV at onset were significantly associated with an increased risk of 6-month NRM and poorer overall survival in Cox regression, whereas the stem cell source (PBSC with versus without UBC) was the only factor significantly linked to Day-28 treatment non-response in logistic regression; for Panel 2 score—which were dichotomized at the 75th percentile of Luminex-measured analyte values rather than the fixed clinical MAGIC assay cutoff-the high-risk group had a significantly higher hazard of 6-month NRM (HR 4.26; 95% CI 1.64–10.60; P = 0.003), higher odds of Day-28 non-response (OR 8.0; 95% CI 2.52–25.38; P < 0.001), and worse OS (HR 4.19; 95% CI 2.24–10.34; P = 0.002)(Table 2). Exploratory univariate results for the individual biomarkers (sST2, REG3α, TNFα, IL - 6, IL - 8 and sTNFR1) are also provided in Supplementary Table S2.

Figure 1. Boxplots of log10(REG3a), log10(sST2), and panel 2 scores by NRM status. * p = 0.012; ** p = 0.005; *** p = 0.0002.
To assess whether age might confound the relationship between aGVHD grade at onset and NRM, we compared age group (<12 vs ≥12 years) with GVHD grade (I–II vs III–IV) using a Pearson χ² test. There was no significant association between age group and GVHD grade (χ² = 0.98; P = 0.32), indicating that age and GVHD grade are independent in our cohort and that age does not act as a confounder of the GVHD grade–NRM relationship.
Multivariable model for stratification
Variables included in the multivariable Cox model were limited to the pre-specified Panel 2 score and key clinical factors (age and aGVHD onset grade) to help preserve model stability, precluding the addition of other biomarkers. In multivariable analysis for 6-month NRM, OS, and day 28 response.we found that GVHD onset grade was no longer significant for any outcome, whereas age remained an independent predictor of 6-month NRM and OS,but not of day 28 response. The panel 2 score remained significant for every endpoint (Table 3). Combining Panel 2 with age yielded four subgroups for further evaluation: Group1: Panel 2 high and age ≥ 12; Group 2: Panel 2 high and age < 12 y; Group 3: Panel 2 low and age ≥ 12 y; Group 4: Panel 2 low and age < 12y.
The cumulative incidence of NRM for all patients is shown in Figure 2A. To operationalize the models, we first compared 6-month NRM across the four subgroups (Figure 2B). Groups 2, 3, and 4 each had significantly lower NRM than Group 1 (all P < 0.01), whereas no significant differences were observed among Groups 2, 3, and 4 (P > 0.05). We therefore combined Groups 2–4 into a single “low-risk” (LR) category, with Group 1 designated “high-risk” (HR). The LR and HR groups had 6-month NRM rates of 12.2% and 71%, respectively; compared with the LR group, the HR group had an HR of 5.0 (95% CI 1.75 - 9.56; P = 0.001) (Figure 2C), and their OS rates of 85.7% and 40.0% also differed significantly across strata (P < 0.001) (Figure 2D). Therefore, we established a prognostic model based on Panel 2 and patient age.

Figure 2. Multivariable model–stratified NRM and OS. (A) Cumulative incidence of NRM for all patients; (B) NRM stratified by four groups; (C) NRM comparison between HR and LR groups; (D) Overall survival stratified by HR vs LR groups.
Since age did not correlate with Day-28 treatment response in univariate analysis, we did not apply our prognostic model to response assessment. Instead, we evaluated Day-28 response using only the Panel 2 classification. The Panel 2 high group had a significantly lower CR/PR rate at Day 28 (62%) compared with the Panel 2 low group (92%) (P < 0.001) (Figure 3A).

Figure 3. Treatment response and age-modified prognostic impact of Panel 2 in aGVHD (A). Day-28 response to aGVHD therapy (CR/PR vs. PR/PD) stratified by final high-risk (HR) and low-risk (LR) groups. (B) Six-month non-relapse mortality (NRM) stratified by Panel 2 risk and age. While patients with high Panel 2 scores generally showed increased NRM, those aged <12 years exhibited substantially lower risk, indicating that age further refines prognostication within the high Panel 2 subgroup.
ROC analysis showed that Panel 2 alone yielded an AUC of 0.761 for predicting 6-month NRM, while the addition of age modestly increased the AUC to 0.793 (Figure 4A). In contrast, decision-curve analysis revealed a more pronounced difference in clinical utility. Within the clinically relevant threshold range of 10–40%, the combined model (Panel 2 + age) achieved a maximum net benefit of approximately 0.12, compared with 0.05–0.06 for Panel 2 or age alone (Figure 4B).

Figure 4. Performance of Age, Panel2, and combined models. (A) ROC curves of the three models; (B) Decision curve analysis of the three models.
Discussion
Some studies aim to predict GVHD occurrence and severity at early post-transplant time points, for example, days 7 and 14 post-transplant, while others seek to forecast treatment response and long-term prognosis at GVHD onset or one week after therapy. Early prediction is primarily intended to prevent or mitigate GVHD development; however, once GVHD has occurred, the patient’s condition has already changed, and prognostic assessment after treatment may miss the optimal intervention window. Therefore, this study focuses on the onset time point, as prognostic prediction at onset may have greater clinical translational value. Additionally, several studies have suggested that monitoring dynamic changes in biomarkers is a promising approach (23, 24). However, this method requires more frequent assays and enhanced monitoring, and thus merits further exploration.
In this multicenter, prospective Chinese pediatric allo-HCT cohort, we aimed to validate panel 2, which has been extensively validated and recognized as MAGIC for prognosticating aGVHD in adults, to assess its applicability in our cohort. And we found this panel 2 demonstrated significant prognostic utility for 6-Month NRM, OS and day 28 aGVHD treatment response. These findings align with both adult and pediatric cohorts.
In multivariable analyses of 6-month NRM and OS, both age and the Panel 2 score were independent predictors. However, the aGVHD grade at onset lost statistical significance. The loss of significance for clinical aGVHD grading indicates substantial collinearity with biomarker levels. This finding highlights the superior prognostic utility of Panel 2. Besides, the strong correlation between aGVHD grade and Panel 2 also supports a competitive (or confounding) relationship between traditional clinical assessment and biomarker-based risk stratification in this cohort (Table 1).
Among patients with high Panel 2 scores, age further stratified prognosis: those aged ≥12 years had significantly higher 6-month NRM and worse OS. Integrating age into the risk model reclassified 19 of 105 patients (18.1%) originally deemed high-risk as low-risk—potentially sparing them from overtreatment (Figure 3B). Conversely, adolescents (≥12 years) with elevated scores represent a subgroup requiring intensified surveillance and earlier therapeutic intervention.
Although ROC curve analysis showed that adding age to Panel 2 yielded only a modest improvement in overall discrimination, decision-curve analysis demonstrated clear incremental clinical utility of the combined model. Within the clinically relevant threshold range, the combined model (Panel 2 + age) achieved a maximum net benefit of approximately 0.12, compared with 0.05–0.06 for Panel 2 or age alone. This corresponds to correctly identifying an additional 6–7 high-risk patients per 100 without increasing unnecessary interventions, underscoring the clinical value of incorporating age into risk stratification beyond Panel 2 alone. In exploratory stratified analyses, the prognostic impact of age appeared most pronounced among patients with high Panel 2 scores, which may explain why the incremental AUC gain was modest when averaged across the full cohort.
In our study, age emerged as an independent predictor of six-month NRM risk with a significance level exceeding initial expectations. To explore potential confounders, we compared baseline characteristics between the two age groups and found no evidence that other variables could account for this difference. This observation stands in contrast to a recent MAGIC Consortium study validating the MAGIC algorithm score as a prognostic biomarker in pediatric GVHD patients, in which age (< 12 vs. ≥ 12 years) did not influence NRM in multivariable analysis (25). Moreover, our review of the Chinese literature revealed no analogous studies of pediatric allo-HSCT cohorts for direct comparison.
The divergence between our findings and those of the MAGIC Consortium may be attributable to differences in baseline clinical characteristics and genetic background. In our cohort, 54% of patients had nonmalignant disease indications, compared to 25% in the MAGIC study. Haploidentical donors were used in 43.8% versus 14%; and ATG at > 5 mg/kg per GIAIC (26)regimen was administered to 52.4% of patients, a rate substantially higher than in the MAGIC cohorts.
To explore why patients aged 12 years or older -those in adolescence within the high-risk Panel 2 subgroup- fared worse, we first note that prior large cohort studies have consistently shown an age effect: adolescents and young adults experience higher rates of acute GVHD compared with younger children, a pattern observed in both unrelated donor (27) and matched sibling donor (28) hematopoietic stem cell transplantation. A plausible biological explanation involves puberty-associated changes in immune regulation. Mature donor T cells are both necessary and sufficient to initiate acute GVHD (29), so the development of GVHD largely depends on antigen presentation and subsequent immune cell activation. Puberty is characterized by marked fluctuations in sex hormones, particularly estrogen, which has been shown to enhance antigen-presenting cell activation via estrogen receptor α–dependent pathways in dendritic cells. This amplifies naïve CD4+ T-cell priming, pro-inflammatory cytokine production, and polarization toward Th1 and Th17 lineages—both implicated in aGVHD pathogenesis (30, 31). These mechanisms not only increase susceptibility to acute GVHD but may also intensify disease severity once GVHD occurs, thereby contributing to the higher NRM observed among adolescents with elevated Panel 2 scores. In contrast, in children under 12 years, the presence of an active thymus supports efficient central tolerance through robust negative selection of donor-derived T cells, even in an inflammatory environment (32), which blunts both the incidence and clinical impact of GVHD despite a high biomarker risk profile.
Our study has several limitations. The limited sample size constrained multivariable modeling and prevented robust evaluation of individual GVHD biomarkers, limiting comparison between single biomarkers and composite algorithms. In addition, to reduce overfitting, we used the third-quartile (Q3) value of the Panel 2 distribution as a pragmatic cutoff, which may not represent the most discriminative threshold. Larger cohorts will be needed to validate biomarker-specific contributions, confirm the prognostic role of age, and refine cutoff selection. Future studies may also explore advanced machine learning approaches, such as the Data Ensemble Refinement Greedy Algorithm (DERGA) (33–35) for biomarker-based risk stratification in aGVHD.
In summary, we validated the prognostic utility of the MAGIC Panel 2 biomarker in a Chinese pediatric cohort. By incorporating age into a multivariable model with Panel 2, we improved the accuracy of predicting 6-month NRM and overall survival following aGVHD. Panel 2 also proved to be a robust predictor of day 28 treatment response. These findings clarify the age-dependent performance of Panel 2, thereby laying the groundwork for tailored risk stratification and therapeutic strategies.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics statement
The studies involving humans were approved by the Ethics Committee of Hunan Children’s Hospital (Approval No. KY2021- 55). The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation in this study was provided by the participants’ legal guardians/next of kin.
Author contributions
NS: Investigation, Data curation, Writing – original draft. HX: Writing – original draft, Writing – review & editing. RX: Formal Analysis, Writing – review & editing, Data curation, Methodology. WC: Writing – review & editing, Formal Analysis. SH: Data curation, Writing – review & editing, Investigation. SD: Writing – review & editing, Investigation, Data curation. BZ: Supervision, Writing – review & editing, Methodology. DW: Methodology, Supervision, Writing – review & editing.
Funding
The author(s) declare that no financial support was received for the research, and/or publication of this article.
Acknowledgments
All authors sincerely acknowledge all pediatric participants and their guardians for their contribution to this research.
Conflict of interest
Authors RX and W-GC were employed by Beijing BFR Gene Diagnostics Co., Ltd.
The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declare that no Generative AI was used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fimmu.2025.1660861/full#supplementary-material
References
1. Stem Cell Application Group CSoH and Chinese Medical Association. Chinese consensus of allogeneic hematopoietic stem cell transplantation for hematological disease (III) -acute graft-versus-host disease (2020). Zhonghua xue ye xue za zhi = Zhonghua xueyexue zazhi. (2020) 41:529–36. doi: 10.3760/cma.j.issn.0253-2727.2020.07.001
2. Chen S and Zeiser R. Novel biomarkers for outcome after allogeneic hematopoietic stem cell transplantation. Front Immunol. (2020) 11:1854. doi: 10.3389/fimmu.2020.01854
3. Balakrishnan B, Kulkarni UP, Pai AA, Illangeswaran RSS, Mohanan E, Mathews V, et al. Biomarkers for early complications post hematopoietic cell transplantation: Insights and challenges. Front Immunol. (2023) 14:1100306. doi: 10.3389/fimmu.2023.1100306
4. Paczesny S. Biomarkers for posttransplantation outcomes. Blood. (2018) 131:2193–204. doi: 10.1182/blood-2018-02-791509
5. Adom D, Rowan C, Adeniyan T, Yang J, and Paczesny S. Biomarkers for allogeneic HCT outcomes. Front Immunol. (2020) 11:673. doi: 10.3389/fimmu.2020.00673
6. Choi SW, Kitko CL, Braun T, Paczesny S, Yanik G, Mineishi S, et al. Change in plasma tumor necrosis factor receptor 1 levels in the first week after myeloablative allogeneic transplantation correlates with severity and incidence of GVHD and survival. Blood. (2008) 112:1539–42. doi: 10.1182/blood-2008-02-138867
7. Ferrara JLM, Harris AC, Greenson JK, Braun TM, Holler E, Teshima T, et al. Regenerating islet-derived 3-alpha is a biomarker of gastrointestinal graft-versus-host disease. Blood. (2011) 118:6702–8. doi: 10.1182/blood-2011-08-375006
8. Lugt MTV, Braun TM, Hanash S, Ritz J, Ho VT, Antin JH, et al. ST2 as a marker for risk of therapy-resistant graft-versus-host disease and death. New Engl J Med. (2013) 369:529–39. doi: 10.1056/NEJMoa1213299
9. Paczesny S, Krijanovski OI, Braun TM, Choi SW, Clouthier SG, Kuick R, et al. A biomarker panel for acute graft-versus-host disease. Blood. (2009) 113:273–8. doi: 10.1182/blood-2008-07-167098
10. Levine JE, Braun TM, Harris AC, Holler E, Taylor A, Miller H, et al. A prognostic score for acute graft-versus-host disease based on biomarkers: a multicentre study. Lancet Haematology. (2015) 2:e21–9. doi: 10.1016/s2352-3026(14)00035-0
11. McDonald GB, Tabellini L, Storer BE, Lawler RL, Martin PJ, and Hansen JA. Plasma biomarkers of acute GVHD and nonrelapse mortality: predictive value of measurements before GVHD onset and treatment. Blood. (2015) 126:113–20. doi: 10.1182/blood-2015-03-636753
12. Hartwell MJ, Özbek U, Holler E, Renteria AS, Major-Monfried H, Reddy P, et al. An early-biomarker algorithm predicts lethal graft-versus-host disease and survival. JCI Insight. (2017) 2:e89798. doi: 10.1172/jci.insight.89798
13. Asteris PG, Gandomi AH, Armaghani DJ, Mohammed AS, Bousiou Z, Batsis I, et al. Pre-transplant and transplant parameters predict long-term survival after hematopoietic cell transplantation using machine learning. Transpl Immunol. (2025) 90:102211. doi: 10.1016/j.trim.2025.102211
14. Verbeek AB, von Asmuth EGJ, van den Akker EB, Jansen-Hoogendijk AM, Schilham MW, Lankester AC, et al. Pre-transplant inflammation and its associations with acute GvHD and mortality in pediatric allogeneic hematopoietic stem cell transplantation patients. Bone Marrow Transplant. (2025) 60:948–55. doi: 10.1038/s41409-025-02583-5
15. Pidala J, Hamadani M, Dawson P, Martens M, Alousi AM, Jagasia M, et al. Randomized multicenter trial of sirolimus vs prednisone as initial therapy for standard-risk acute GVHD: the BMT CTN 1501 trial. Blood. (2020) 135:97–107. doi: 10.1182/blood.2019003125
16. Hotta M, Satake A, Yoshimura H, Fujita S, Katayama Y, Ota S, et al. Elevation of early plasma biomarkers in patients with clinical risk factors predicts increased nonrelapse mortality after allogeneic hematopoietic stem cell transplantation. Transplant Cell Ther. (2021) 27:660.e1–.e8. doi: 10.1016/j.jtct.2021.04.025
17. Robin M, Porcher R, Michonneau D, Taurines L, FSd F, Xhaard A, et al. Prospective external validation of biomarkers to predict acute graft-versus-host disease severity. Blood advances. (2022) 6:4763–72. doi: 10.1182/bloodadvances.2022007477
18. Öztürk G, Bayrakoğlu D, Haskoloğlu Ş, Baskın K, Deveci N, İnce E, et al. ST2 and Reg3α: Can they predict aGvHD, steroid refractoriness and transplant-related mortality in pediatric patients after HSCT? Hematology transfusion Cell Ther. (2024) 46 Suppl 6:S129–S35. doi: 10.1016/j.htct.2024.02.026
19. Harris AC, Young R, Devine S, Hogan WJ, Ayuk F, Bunworasate U, et al. International, multicenter standardization of acute graft-versus-host disease clinical data collection: A report from the mount sinai acute GVHD international consortium. Biol Blood marrow transplantation: J Am Soc Blood Marrow Transplantation. (2016) 22:4–10. doi: 10.1016/j.bbmt.2015.09.001
20. Asteris PG, Skentou AD, Bardhan A, Samui P, and Pilakoutas K. Predicting concrete compressive strength using hybrid ensembling of surrogate machine learning models. Cement Concrete Res. (2021) 145. doi: 10.1016/j.cemconres.2021.106449
21. Armaghani DJ and Asteris PG. A comparative study of ANN and ANFIS models for the prediction of cement-based mortar materials compressive strength. Neural Computing Applications. (2020) 33:4501–32. doi: 10.1007/s00521-020-05244-4
22. Kerr KF, Brown MD, Zhu K, and Janes H. Assessing the clinical impact of risk prediction models with decision curves: guidance for correct interpretation and appropriate use. J Clin oncology: Off J Am Soc Clin Oncol. (2016) 34:2534–40. doi: 10.1200/jco.2015.65.5654
23. Matsumura A, Miyazaki T, Tachibana T, Ando T, Koyama M, Koyama S, et al. Predictive values of early suppression of tumorigenicity 2 for acute GVHD and transplant-related complications after allogeneic stem cell transplantation: prospective observational study. Turkish J haematology: Off J Turkish Soc Haematology. (2020) 37:20–9. doi: 10.4274/tjh.galenos.2019.2019.0139
24. Verma K, Croft W, Greenwood D, Stephens C, Malladi R, Nunnick J, et al. Early inflammatory markers as prognostic indicators following allogeneic stem cell transplantation. Front Immunol. (2023) 14:1332777. doi: 10.3389/fimmu.2023.1332777
25. Qayed M, Kapoor U, Gillespie S, Westbrook A, Aguayo-Hiraldo P, Ayuk FA, et al. A validated risk stratification that incorporates MAGIC biomarkers predicts long-term outcomes in pediatric patients with acute GVHD. Transplant Cell Ther. (2024) 30:603. doi: 10.1016/j.jtct.2024.03.022
26. Wu T and Lu DP. Unmanipulated haploidentical blood and marrow transplantation: where we are. Hong Kong Med J = Xianggang yi xue za zhi. (2009) 15:27–30.
27. Andolina JR, Wang YC, Ji L, Freyer DR, Levine JE, Pulsipher MA, et al. Adolescent and young adult (AYA) versus pediatric patients with acute leukemia have a significantly increased risk of acute GVHD following unrelated donor (URD) stem cell transplantation (SCT): the Children’s Oncology Group experience. Bone Marrow Transplant. (2022) 57:445–52. doi: 10.1038/s41409-021-01558-6
28. Qayed M, Wang T, Hemmer MT, Spellman S, Arora M, Couriel D, et al. Influence of age on acute and chronic GVHD in children undergoing HLA-identical sibling bone marrow transplantation for acute leukemia: implications for prophylaxis. Biol Blood Marrow Transplant. (2018) 24:521–8. doi: 10.1016/j.bbmt.2017.11.004
29. Toubai T, Sun Y, and Reddy P. GVHD pathophysiology: is acute different from chronic? Best Pract Res Clin Haematol. (2008) 21:101–17. doi: 10.1016/j.beha.2008.02.005
30. Khan D and Ansar Ahmed S. The immune system is a natural target for estrogen action: opposing effects of estrogen in two prototypical autoimmune diseases. Front Immunol. (2015) 6:635. doi: 10.3389/fimmu.2015.00635
31. Laffont S, Seillet C, and Guéry J-C. Estrogen receptor-dependent regulation of dendritic cell development and function. Front Immunol. (2017) 8:108. doi: 10.3389/fimmu.2017.00108
32. Palmer DB. The effect of age on thymic function. Front Immunol. (2013) 4:316. doi: 10.3389/fimmu.2013.00316
33. Asteris PG, Gavriilaki E, Kampaktsis PN, Gandomi AH, Armaghani DJ, Tsoukalas MZ, et al. Revealing the nature of cardiovascular disease using DERGA, a novel data ensemble refinement greedy algorithm. Int J Cardiol. (2024) 412:132339. doi: 10.1016/j.ijcard.2024.132339
34. Asteris PG, Gandomi AH, Armaghani DJ, Kokoris S, Papandreadi AT, Roumelioti A, et al. Prognosis of COVID-19 severity using DERGA, a novel machine learning algorithm. Eur J Internal Med. (2024) 125:67–73. doi: 10.1016/j.ejim.2024.02.037
Keywords: acute graft-versus-host disease, allogeneic hematopoietic stem cell transplantation, non-relapse mortality, MAGIC algorithm, pediatric, risk stratification
Citation: Song N, Xiong H, Xu R, Cheng W-G, He S, Deng S, Zhang B and Wang D (2025) Age-enhanced MAGIC algorithm predicts mortality in pediatric aGVHD: a multicenter study. Front. Immunol. 16:1660861. doi: 10.3389/fimmu.2025.1660861
Received: 07 July 2025; Accepted: 25 August 2025;
Published: 12 September 2025.
Edited by:
Tomomi Toubai, Yamagata University, JapanReviewed by:
Daniel Peltier, University of Michigan, United StatesPanagiotis G. Asteris, School of Pedagogical and Technological Education, Greece
Copyright © 2025 Song, Xiong, Xu, Cheng, He, Deng, Zhang and Wang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Benshan Zhang, emhiczMzM0BzaW5hLmNvbQ==; Dao Wang, ZGVhaTMxNUAxNjMuY29t
†These authors have contributed equally to this work