The future is in the background: background EEG patterns, not acute seizures, predict epilepsy and neurodevelopmental outcomes in neonatal HIE

Woodward, Kristine E.; de Jesus, Pauline; Amador, Kimberly; Mouches, Pauline; Braun, Marvin; Mohammad, Khorshid; Forkert, Nils D.; Esser, Michael J.

doi:10.3389/fped.2025.1560760

ORIGINAL RESEARCH article

Front. Pediatr., 29 September 2025

Sec. Neonatology

Volume 13 - 2025 | https://doi.org/10.3389/fped.2025.1560760

This article is part of the Research TopicEvaluating Efficacy and Outcomes in Neonatal HIE Treatment: A Global PerspectiveView all 10 articles

The future is in the background: background EEG patterns, not acute seizures, predict epilepsy and neurodevelopmental outcomes in neonatal HIE

Kristine E. Woodward^1*

Pauline de Jesus²

Kimberly Amador³

Pauline Mouches³

Marvin Braun¹

Khorshid Mohammad¹

Nils D. Forkert^2,3,4

Michael J. Esser^1,2

¹Department of Pediatrics, Cummings School of Medicine, University of Calgary, Calgary, AB, Canada
²Department of Clinical Neurosciences, Cummings School of Medicine, University of Calgary, Calgary, AB, Canada
³Department of Radiology, Cummings School of Medicine, University of Calgary, Calgary, AB, Canada
⁴Department of Electrical and Software Engineering, Schulich School of Engineering, University of Calgary, Calgary, AB, Canada

Background: Hypoxic ischemic encephalopathy (HIE) is the most common neurologic emergency in the neonatal population, with a broad spectrum of potential neurodevelopmental outcomes. Additionally, HIE is the most common cause of seizures during the acute neonatal period. Unfortunately, predicting neurodevelopmental outcomes and epilepsy risk is difficult in this population, and seizure burden during the acute period has not consistently been correlated with outcomes in prior studies. We aimed to examine EEG background data to determine whether there is a relationship between background abnormalities, neurodevelopmental outcomes, and epilepsy risk, and whether this information is more informative for predicting outcomes compared to other clinical data points.

Methods: Patients were retrospectively recruited from level 3 Neonatal Intensive Care Units (NICU's) in Calgary, Alberta, from 2014 to 2020. All patients who met the criteria for therapeutic hypothermia after being classified as at risk for HIE were included in the study. Clinical information captured included measures from clinical examination, blood work, MRI (day 3–5, scored using Barkovich scoring system) and medications. Continuous video EEG (cvEEG) recordings were separated into day 1, 2, and 3, and separate classifications systems were used for background and ictal findings. Neurodevelopmental follow-up was completed at two years of age, and patients were also categorized as having no epilepsy, or either well-controlled or refractory epilepsy. Poisson regression models and relative risk were used to compare background and ictal scores to long term neurodevelopmental outcomes and future epilepsy risk. Three supervised learning algorithms were trained to predict neurodevelopmental outcomes based on clinical factors.

Results: Two-hundred and six patients were eligible for the study. Among neonates with seizures, only 18% developed epilepsy, while 52% of those with severely abnormal EEG background patterns did. Total ictal burden was not significantly associated with epilepsy at follow up, and no antiseizures medications were significant predictors. In contrast, EEG background score was strongly associated with epilepsy risk (adjusted ß = 2.75, p = 0.002), with severely abnormal backgrounds conferring significantly increased risk (37.5% vs. 5.2%, RR = 7.22, 95% CI: 3.09–16.88). Similarly, ictal burden did not predict poor neurodevelopmental outcome or death, whereas background score was a strong predictor (adjusted ß = 1.74, p < 0.001; RR = 2.44, 95% CI: 1.70–3.50). Machine learning models identified background features as more predictive than ictal scores, with XGBoost achieving the best classification performance (accuracy 0.724) and random forest yielding the highest AUC (0.751).

Conclusions: In our cohort, EEG background patterns outperformed ictal burden in predicting both neurodevelopmental outcomes and future epilepsy risk. Although background patterns are not directly modifiable, they provide powerful, early markers of brain injury severity, offering clinicians a valuable tool for prognostication and family counseling at a critical juncture in care.

Introduction

Hypoxic ischemic encephalopathy (HIE) is one of the most common neurologic emergencies in the neonatal population, occurring in 1–8 per 1,000 births worldwide (1). The spectrum of outcomes is broad, ranging from normal neurodevelopment to death. Developmental delays are common and can affect gross and fine motor skills, language and cognitive function, and social skills. Additionally, previous research suggests approximately 10% of patients develop epilepsy, many of whom are refractory to anti-seizure medications (2). Despite advances in neuromonitoring and targeted treatments during the acute phase, predicting patient outcomes to counsel families is difficult. Many studies have investigated potential predictors of long-term neurodevelopmental outcomes in HIE, with common variables being clinical examination (i.e., Sarnat score), specific laboratory measures (i.e., cord blood gas, lactate), neuroimaging findings (specifically MRI), and electroencephalography (EEG) tracings. With more widespread access to continuous EEG monitoring, research has expanded towards characterizing seizure burden and temporal EEG evolution in neonatal HIE, with many studies attempting to use these findings to help predict developmental outcomes. While some studies have described EEG characteristics and outcomes (3, 4), some with limited associations (5), others have identified promising early EEG predictors (6–11). The methodology has varied in using continuous EEG as a predictive marker, including calculating total vs. hourly seizure burden, characterizing EEG patterns during the rewarming period only, separating ictal vs. background EEG features, and using spot EEG analysis vs. averaging more prolonged periods (6–11). Until recently, most studies using quantitative EEG measures were small and did not include additional clinical variables in predictive models.

In this study, we incorporated neurophysiological and clinical data from a large cohort of patients with neonatal HIE using machine learning models, to evaluate the predictive ability for long-term neurodevelopmental outcomes, including future epilepsy risk. In particular, we examined specific EEG markers of background and ictal activity to explore the relationship with these outcomes. Understanding the relationship between clinical markers and outcomes affords the opportunity to improve acute clinical decision making and better guide prognostic discussions with families.

Materials and methods

Patient cohort

Patients were retrospectively recruited from level 3 Neonatal Intensive Care Units (NICUs) in Calgary, Alberta from 2014 to 2020. All patients who met the criteria for therapeutic hypothermia (TH) after being classified as at risk for hypoxic ischemic encephalopathy (HIE) were included in the study. At our institution, initiation of therapeutic hypothermia requires first that babies are ≥35 weeks gestation and ≤6 h old, and subsequently meet both criteria A and B defined as follows: (A) umbilical cord or first-hour arterial gas pH ≤ 7.0 or base excess ≤ −16 (mmol/L), or Apgar score ≤5 at 10 min, or ongoing need for respiratory support at 10 min of birth; AND (B) evidence of moderate to severe encephalopathy, demonstrated by the presence of seizures or at least one sign in three or more of six major categories (Sarnat Score: level of consciousness, spontaneous activity, posture, tone, primitive reflexes, autonomic system) (12). Additionally, patients were excluded from our study if they were moribund or had any major congenital/genetic abnormalities for which no further treatment was planned, severe intrauterine growth restriction (IUGR), significant coagulopathy, or severe intracranial bleeding (12). Patients were also excluded if electronic medical records were not accessible to capture the variables listed below. Research was conducted in accordance with institutional requirements and policies (IRISS University of Calgary #REB15-1249).

Clinical data

Babies categorized as having HIE were treated with therapeutic hypothermia using whole-body cooling blankets with a built-in thermoregulator (CritiCool®) that maintained a temperature of 33.5 degrees Celsius. Continuous EEG was recorded for the duration of TH (∼72 h) and until the babies were rewarmed to physiological normal temperature (∼6 h).

Clinical information was captured for each patient, including referral centre (rural, urban non-cooling centre, urban cooling centre), gestational age at birth, birth weight, APGAR score at 1, 5, and 10 min, Sarnat score at admission and discharge (13), cord arterial pH, cord arterial base excess, lactate at one hour of age, anti-seizure medications (ASMs) administered acutely (e.g., levetiracetam, phenobarbital, fosphenytoin), and if continued upon discharge, pain/sedative medications (morphine, dexmedetomidine, fentanyl) administered, MRI (day 3–5), EEG (72 h recording), length of stay, a diagnosis of epilepsy at out-patient follow-up, and neurodevelopmental follow-up assessments as described below.

Magnetic resonance imaging scans (1.5 T or 3 T Siemens MR Scanner) were graded based on the combined Barkovich basal ganglia/watershed scoring system by a neuroradiologist and pediatric neurologist using T1- and T2-weighted images (14). EEGs were scored by two independent neurophysiologists (KW and MB). EEG data capture (Natus® NeuroWorks®, restricted 10–20 system using nine electrodes and designated neonatal montage) was separated into day 1 (from initiation of recording to 24 h since cooling onset), day 2 (24 to 48 h of cooling), and day 3 (48 to 72 h of cooling). Background scores and ictal scores were analyzed and calculated separately as follows using the American Clinical Neurophysiology Society Standardized EEG guidelines for Neonates (15). For background scores, a score of 0 indicated normal continuity whereby there was uninterrupted non-stop electrical activity with <2 s of voltage attenuation <25 uV. A score of 1 indicated abnormal excessive discontinuity, where the IBI was prolonged or voltage depressed (for term, longer than 6 s and <25 uV). Severely abnormal background (score of 2) indicated invariant, abnormally composed EEG bursts (or no bursting) with low voltage <5 uV and no normal electrographic elements within the bursts. Additionally, seizure burden was calculated for each patient during day 1, 2 and 3 for both the entire 24 h, as well as highest 1 h seizure burden period during that day (using a sliding-window technique). Total ictal burden was calculated for the entire recording for each patient (as a continuous variable). When specified, the highest 1 h seizure burden was used to provide an “ictal score” for each day, whereby 0 = no seizures, 1 = seizures but not meeting criteria for status epilepticus, and 2 = status epilepticus (≥30 min in 1 h). Total background score was calculated by adding each daily score for a score out of 6 (i.e., worst score would be severe suppression on day 1, 2 and 3 = 2 + 2 + 2 = 6, and best score would be 0, equating a normal background for all three days), and total ictal score was calculated by adding each daily score for a score out of 6 (i.e., worst score would be status epilepticus on day 1, 2, and 3 = 2 + 2 + 2 = 6, and best score would be 0, indicating no seizures). Neurodevelopmental follow-up was completed at approximately 24 months using the Ages and Stages Assessment (16). Neurodevelopmental impairment was characterized as ≥2 standard deviations outside the normal range in any domain.

Statistical analysis

Each feature was compared to neurodevelopmental outcomes using a Mann–Whitney U-test, and Benjamini–Hochberg correction for false discovery rate. We used Poisson regression models with restricted cubic splines to evaluate the relationship between total EEG background score and neurodevelopmental outcome, as well as future epilepsy risk. The primary outcome was binary (poor vs. good neurodevelopmental outcome, or epilepsy vs. no epilepsy) and the main predictor was total EEG background score. Splines were used to flexibly model non-linear relationships without assuming a specific parametric form. We included both an unadjusted model (included only the spline-transformed EEG background score as a predictor) and an adjusted model [including binary indicators for medication exposure (dexmedetomidine, morphine, fentanyl, levetiracetam, phenobarbital, fosphenytoin)]. Poisson models were fit using the Generalized Linear Model framework with a log link function. Model fit was assessed using pseudo R² statistics (Cragg-Uhler), and 95% confidence intervals were generated for all predictions. Predictor probabilities of poor neurodevelopmental outcome, or epilepsy, were plotted against EEG background scores. Based on these results, patients were split into two groups; those with “severely abnormal EEG background scores (total score of 5 or 6)” and those with “mildly/moderately abnormal or normal EEG background scores (total score of 0–4)”. Relative risk, using a 95% confidence interval, was calculated to determine risk of future epilepsy. The same was used to calculate risk of poor neurodevelopmental outcomes or death.

Ictal scores were also compared to both neurodevelopmental follow up and future epilepsy risk using Poisson regression models with restricted cubic splines, and relative risk was determined as described above.

Chi-squared test was used to calculate the difference in future epilepsy between patients that were discharged on ASMs and those who were not.

SPSS and Python were used to conduct all statistical analyses.

Machine learning setup

The machine learning paradigm used in this study consisted of a feature ranking and selection method followed by a classification model. The aim of feature ranking is to sort the available features based on relevance or importance to predict the outcome variable (i.e., good/normal vs. poor/abnormal neurodevelopmental outcome), which is then used for feature selection (17–19).

For this study, the information gain algorithm was used to statistically determine the amount of information that is gained from each feature when predicting neurodevelopmental outcomes. The resulting feature ranking was then used to determine the optimal number of input features for the classifier. This was achieved by removing the least relevant features in an iterative fashion and then retraining and evaluating the classifier in terms of accuracy, thereby decreasing the dimensionality problem to improve the model performance. Three supervised learning algorithms were used to evaluate predictive performance, including logistic regression, random forest, and extreme gradient boosting, in order to account for imbalance and smaller datasets.

The machine learning models were trained to predict good/normal vs. poor/abnormal neurodevelopmental outcomes. All of the clinical features outlined above were initially included in the model and iteratively reduced by removing the lowest-ranked feature. Due to the class imbalance between normal and abnormal neurodevelopmental outcomes, a random under-sampling approach was used, resulting in a perfectly balanced dataset. We repeated this process ten times to reduce any potential bias induced by the random under-sampling approach.

Using the balanced datasets, a 10-fold cross-validation approach was used to quantitatively evaluate the model performance. This means that ten different models were trained for each experiment, each of which randomly selected 90% of the data for training and 10% for testing. The results of each fold were averaged to compute outcome measures, including accuracy (i.e., percent correctly classified), precision (i.e., positive predictive value), recall (i.e., sensitivity), F-measure (i.e., harmonic mean of precision and recall), and area under the curve.

Results

Patient characteristics

Two-hundred and six patients were admitted to hospital between 2014 and 2020 and were eligible for the study. Patient information is listed in Table 1, including median, interquartile ranges, minimum, and maximum values for each clinical variable for each outcome group (normal neurodevelopmental outcome vs. poor neurodevelopmental outcome or death). At admission to the NICU, clinical examinations were documented, and patients were classified as mild, moderate, or severe HIE based on the Sarnat classification scale. For comparison, clinical examinations were also documented after rewarming using the Sarnat classification scale. Most patients had lower Sarnat scores post rewarming, and no patients had higher scores.

Table 1

Table 1. Summary of patient data. Variables shown in column 1, Column 2 and 3 depict patients separated into groups of normal neurodevelopmental outcome and poor neurodevelopmental outcome/death with medians, and interquartile ranges in brackets. Column 4 depicts the minimum and maximum score for each variable in the total patient group. Column 5 shows the p-value comparing groups of normal and poor neurodevelopmental outcome/death for each variable using Mann–Whitney U. The last column shows the corrected p-value for multiple comparisons using the Benjamini–Hochberg method.

MRIs were available for 196 patients, with 10 not performed due to patient death prior to imaging. Five patients had MRIs on day 3 immediately preceding death, whereas the remainder were scanned on day 4 or 5. One-hundred and thirty-eight patients had normal MRIs (70.4%). Of the remaining patients, 11 had a Barkovich score of 1 (5.6%), 16 had a score of 2 (8.2%), 19 had a score of 3 (9.7%), and 12 had a score of 4 (6.5%).

EEG results were available for 191 patients on all three days. Background and ictal scoring results are shown in Figure 1. Background patterns overall, even in moderate and severe HIE, showed a trend towards normalization from day 1 to 3, with an increasing number of EEGs receiving a score of 0, and a decreasing number of EEGs with a score of 1 or 2. In terms of ictal grading, there was increasingly more patients with a score of 0 (i.e., no ictal activity) over the three days. However, only 4% patients continued to have seizures on day 3. Also of importance, only 1.5% of patients with seizures on day 3 did NOT have seizures on either day 1 or day 2. Thirty four percent of patients had seizures on EEG during the first three days of life. In terms of total ictal burden, patients on average had 53 min and 30 s of ictal EEG activity over the three days (range: 0 h–43 h 24 min). For day 1 this was a mean of 0 h 35 min 39 s (range: 0 h–14 h 25 min), for day 2 this was a mean of 0 h 11 min 19 s (range: 0 h–19 h 15 min), and for day 3 this was a mean of 0 h 6 min 3 s (range: 0 h–9 h 44 min). Forty-eight percent of patients received anti-seizure medications; of these, 54% had abnormal movements suspected to be clinical seizures prior to EEG being connected, without any further seizures on EEG. The most frequently used anti-seizure medication was phenobarbital, followed by levetiracetam and then fosphenytoin (further details shown in Figure 2).

Figure 1

Two-part diagram titled A and B, illustrating changes in proportions over three days using horizontal bars. Part A shows categories 0, 1, and 2 with initial values of 47%, 30%, and 23%, progressing to 61%, 21%, and 17%. Part B starts at 75%, 14%, and 12%, ending at 96%, 2%, and 2%. Colored flows connect categories across days, showing the migration of percentages between groups.

Figure 1. Sankey diagram showing (A) change in background EEG score over three days of recording for each patient and (B) change in ictal EEG score over three days of recording for each patient. Score is beside each node, with percentage of total patients with that score shown beside.

Figure 2

Figure 2. Number of patients that received each combination of anti-seizure medications.

From the total cohort, 7.3% patients died during the acute period in the hospital. An additional 6.3% were lost due to missing follow-up information at 24 months. At follow-up, 37.1% had neurodevelopmental impairment in at least one domain according to the Ages and Stages Assessment, and 62.9% had normal neurodevelopment at 24 months. Patients were separated into good/normal vs. poor/abnormal overall as a binary measure, given the low power in separating them based on each abnormal neurodevelopmental domain or feature.

Sixty-five patients (34%) had EEG confirmed seizures at some point during the acute period, and of these, 12 patients with seizures in the hospital developed epilepsy (18.5%). Five patients without seizures in hospital developed epilepsy (4% of patients without acute seizures). Seventeen patients (8.9%) had epilepsy, and 8 (4.2%) of these patients had refractory epilepsy at follow-up. Of the 17 patients with epilepsy, 8 were discharged from the hospital on anti-seizure medications (47.0% of those with epilepsy at follow up). In total, 29 patients were discharged on anti-seizure medications due to physician guidance (potentially for parental preference), all of whom had seizures while in the hospital. Twelve of the patients with epilepsy at follow-up had seizures in the hospital (70.6% of the patients with epilepsy). See Figure 3 for a pictorial depiction.

Figure 3

Diagram comparison of two Venn diagrams. Diagram A shows three overlapping circles labeled: Seizures in hospital (n=65), Discharged on ASM (n=29), and Epilepsy at follow-up (n=17), with intersection values 21, 8, 4, and 5. Diagram B represents three overlapping circles for Severely abnormal background EEG (n=33), Discharged on ASM (n=29), and Epilepsy at follow-up (n=17), intersecting at 8, 10, 7, and 14.

Figure 3. Weighted venn diagrams showing number of patients with overlap, having (A) seizures in hospital, being discharged on ASMs, and having epilepsy at follow-up or (B) having a severely abnormal background EEG, being discharged on ASMs, and having epilepsy at follow-up. Numbers in each overlap area indicate number of patients falling into that overlap region. N in brackets depicts total number of patients in that category.

Severely abnormal EEG background classification was given for those babies with a total background score of 5 or 6. There were 33 patients with a background score of 5 or 6 during the 3 days of EEG recording. Of these 33 patients, 17 had epilepsy at follow up (51.5%), 7 of whom were discharged on ASMs. See Figure 3 for a pictorial depiction.

Within our cohort, there were no significant differences in epilepsy prevalence between the groups who were discharged on ASM and those who were not (X² = 1.68, p = 0.19).

Future epilepsy risk

As mentioned above, only 18.5% of patients with seizures in hospital had epilepsy at follow up. In contrast, 51.5% of patients with severely abnormal EEG background scores had epilepsy at follow up.

Figure 4 depicts the distribution of patients with no epilepsy, well controlled epilepsy, and refractory epilepsy at follow up in groups of patients separated based on total ictal scores while in hospital. As shown, there is no clear trend to suggest a relationship between worse ictal scores and epilepsy. In line with this, using a Poisson regression model with restricted cublic splines there was not a significant association between total ictal burden and epilepsy at follow up (ß = −0.0002, p = 0.47). None of the covariates (levetiracetam, phenobarbital, fosphenytoin, morphine, fentanyl, dexmedetomidine) demonstrated statistically significant associations with epilepsy risk. This model explained approximately 2% of the variance in the epilepsy outcome (pseudo R² = 0.0196). This was also analyzed without ASMs as covariates, and in the unadjusted Poisson regression model, total ictal burden was not significantly associated with epilepsy at follow up (ß = −0.0002, p = 0.43), with pseudo R² = 0.011 indicating that the ictal burden alone explains approximately 1% of the variability in epilepsy outcomes. Calculating relative risk, patients with seizures in hospital were not more likely to have epilepsy at follow-up compared to those without seizures in hospital (18% vs. 4% RR = 3.25, 95% CI = 0.759–13.907).

Figure 4

Four horizontal stacked bar charts labeled A, B, C, and D depict patient outcomes based on EEG recordings. Charts A and B classify outcomes as no epilepsy, well-controlled epilepsy, and refractory epilepsy. Charts C and D categorize neurodevelopmental outcomes as good, poor, and deceased. Each bar represents total seizure or background scores over three days, with the percentage of patients in each category shown.

Figure 4. Comparison of patient groups based on (A) total 3-day ictal scores and (B) total 3-day background scores and epilepsy at follow-up, including patients with no epilepsy, those with well controlled epilepsy, and those with refractory epilepsy; and comparison of patient groups based on (C) total 3-day ictal score and (D) total 3-day background scores and outcomes at follow-up, including patients with good neurodevelopmental outcomes, those with poor neurodevelopmental outcomes, and patients who died in hospital.

Figure 4 also shows the number of patients with no epilepsy, well-controlled epilepsy, and refractory epilepsy at follow up in groups of patients based on total background score while in hospital. As can be seen, there is a trend to suggest a relationship between worse background scores and likelihood of having epilepsy. In line with this, using a Poisson regression model with restricted cubic splines (df = 3) we identified a non-linear relationship, with a steep increase in epilepsy probability observed among patients with more severe background abnormalities (Figure 5). This suggests that while mildly abnormal backgrounds display similar low future epilepsy risk, severely abnormal EEGs are particularly predictive of epilepsy (spline 3; ß = 3.55, p < 0.001) and the spline-based model explains 34.5% of the variance in epilepsy outcome (pseudo R² = 0.345). Importantly, this association persisted after adjusting for antiseizure medications (ß = 2.75, p = 0.002 and pseudo R² = 0.39). Phenobarbital and levetiracetam were independently associated with higher probability of epilepsy, likely due to clinical indication in that they are used in patients at higher clinical risk. No other medications had significant association with epilepsy (fosphenytoin, morphine, fentanyl, dexmedetomidine). Given these findings, patients with severely abnormal background scores (5 or 6) were compared to those with mildly abnormal or normal background scores (0–4), and using relative risk ratios were found to have significantly increased risk of epilepsy at follow-up [51.5% vs. 0% RR = 2.13 (adjusted by applying a continuity correction given the “0%”), 95% CI = 1.47–3.07].

Figure 5

Graph A shows the predicted probability of epilepsy at follow-up increasing with total EEG background score, with a shaded area indicating the ninety-five percent confidence interval. Graph B displays a similar trend for the predicted probability of poor neurodevelopmental outcome, also showing a shaded confidence interval.

Figure 5. Poisson regression models with restricted cubic splines and 95% confidence intervals for (A) predicted probability of epilepsy at follow-up based on total EEG background scores over 3 days and (B) predicted probability of poor neurodevelopmental outcome or death based on total EEG background scores over 3 days.

Neurodevelopmental outcomes

Mann–Whitney U-test for each feature compared to neurodevelopmental outcome, corrected for multiple comparisons (Benjamini–Hochberg), is listed in Table 1. The only significant features were neurological exam at admission, and background EEG scores.

Average total ictal scores compared with neurodevelopmental outcomes and death are depicted in Figure 4. No clear trend was apparent. The poisson regression model with restricted cubic splines did not demonstrate a significant relationship between total ictal burden and outcomes (ß = 0.0002, p = 0.43, pseudo R² = 0.011). Results did not change taking medications into account as a covariate. When comparing patients with any seizures in hospital compared to those with none, there was not an increased risk of poor neurodevelopmental outcomes/death (42% vs. 33% RR = 1.339, 95% CI = 0.911–1.968).

Figure 4 demonstrates the number of patients in each neurodevelopmental outcome category with groups of patients separated based on total background score while in hospital. As shown, there is a trend to suggest a relationship between worse background scores and poor outcomes. Using a Poisson regression model with restricted cubic splines (df = 4), there was a significant association between total background score and neurodevelopmental outcome, using two models (both adjusted and unadjusted for medications) (Figure 5). In the unadjusted model, higher EEG background scores were associated with increased risk of poor neurodevelopmental outcome (ß = 1.55, p < 0.001), with the model explaining a moderate amount of outcome variability (pseudo R² = 0.183). In the adjusted model (including medications as covariates), similar findings were seen, with higher splines showing significant associations with outcome (ß = 1.74, p < 0.001), with similar pseudo R² values (0.189). Patients with severely abnormal background scores were more likely to have poor neurodevelopmental outcomes than those with good background scores while in hospital (62.2% vs. 25.4% RR = 2.44, 95% CI = 1.70–3.50).

Predictive modelling

Model performance varied across the three classifiers. XGBoost achieved the highest accuracy (0.724), precision (0.611), recall (0.476), and F1 score (0.519), indicating superior performance in identifying cases with poor outcomes (Figure 6, Table 2). However, random forest had the highest AUC (0.751) suggestion better overall discrimination ability (Figure 6, Table 2). To better illustrate overlap in predictive value across methods, a Venn diagram was constructed comparing the top 10 features selected by each of the three models (Figure 6). Interestingly, arterial pH and birth weight were selected in all three models as important features. Background scores were represented more frequently in overlap sections compared to ictal scores, in line with previous results. Other important features in multiple models included MRI scores, gestational age, arterial base excess and APGAR score at 5 min.

Figure 6

Bar chart and Venn diagram comparison. The bar chart shows AUC values for XGBoost, Random Forest, and Logistic Regression, with Random Forest having the highest mean. The Venn diagram illustrates the top 10 feature overlaps across the three models, highlighting common and unique features such as APGAR scores, arterial pH, and birth weight.

Figure 6. (A) Comparison of all three machine learning models area under the curves (AUC) for predicting good vs. poor neurodevelopmental outcome/death and (B) the top ten features selected for each model, and where they overlapped between models.

Table 2

Table 2. Model performance across the three supervised learning algorithms for predicting poor neurodevelopmental outcome or death.

Discussion

This large cohort study examined the importance of specific clinical factors in predicting both neurodevelopmental outcomes and future epilepsy risk in patients with neonatal HIE.

Our main findings demonstrated:

1. cvEEG is an important predictor of neurodevelopmental outcomes.

2. cvEEG background scores are stronger than ictal burden at predicting mortality and poor neurodevelopmental outcomes.

3. cvEEG background scores are stronger than ictal burden at predicting epilepsy at follow up.

4. ASM use in hospital and at discharge does not correlate with future epilepsy risk.

Understanding risk factors for poor neurodevelopmental outcomes and epilepsy can help guide acute treatment decisions, aid in early prognostication, and lead to more informed patient care.

Neurodevelopmental outcomes

In our study, neurodevelopmental outcomes and in hospital mortality were significantly associated with EEG background, but not EEG ictal scores. In 2023, nested within the RCT HEAL study, Glass et al. also reported a significant association between EEG background scores in hospital and neurodevelopmental outcomes (20). The reproducibility of these results with large numbers, but at a single centre, reinforces the importance of these findings. Figure 4 shows that the highest percentage of deceased patients in the group had background EEG scores of 5 or 6. Further, while more patients had suppressed backgrounds on day 1 (Figure 1), these improved by day 3. In order to receive a score of 5 or 6, the background had to remain suppressed or discontinuous for the entirety of the 3 day recording. These results show the added utility of recording EEG for 3 days during the acute period to aid in predicting outcomes, but one could argue that this is predominantly necessary in babies where the EEG background is classified as ’severely abnormal' at the onset to ensure resolution.

Seizure trends and future epilepsy risk

EEG ictal scores were not associated with neurodevelopmental outcomes or future epilepsy risk. Overall, seizure frequency decreased over the three days of cooling, except in a small (4%) percentage that continued to have seizures on day 3, reinforcing the limited need for maintenance ASMs in patients with HIE. A recent study by Glass et al. in 2021 (21) demonstrated the lack of evidence for continuing ASMs in patients with neonatal seizures after discharge from hospital. Our study is aligned with those findings, as there were no significant differences in rates of epilepsy at follow-up in patients discharged home on ASMs vs. those who were not. Furthermore, seizures in hospital did not predict epilepsy at follow-up. In fact, 5 patients with epilepsy at follow-up did not have acute seizures during their admission period, reinforcing the difficulty in predicting future risk of epilepsy.

Overall, in our cohort, EEG background scores were more predictive of epilepsy at follow up than ictal burden. This finding may be explained by the fact background abnormalities reflect global cerebral injury and neuronal dysfunction; severely abnormal EEG backgrounds, such as discontinuity or burst suppression, indicate widespread cortical damage and impaired synaptic recovery, thereby conditions that promote long-term epileptogenesis. In contrast, ictal burden primarily reflects transient instability in the acute phase, and not necessarily long-term neuronal damage and network reorganization. Therefore, background EEG scores, as suggested by our data, is likely a more stable and prognostically relevant biomarker for epilepsy risk in this population.

Predictive modelling using machine learning

The comparative analysis of machine learning models highlights the clinical promise of data driven prediction in neonatal neurocritical care. Among the models tested, XGBoost demonstrated the strongest overall classification performance, suggesting higher sensitivity in detecting infants at risk of poor neurodevelopmental outcome or death. This model consistently identified EEG background scores (individual days and total), and electrographic seizures on day 1 as key predictors, aligning well with existing literature (6). Figure 6 illustrates the overlap in predictive value across methods for top ten features, and highlights how different algorithms capture distinct patterns in data. The multidimensional perspective reinforces the central importance of early EEG findings in prognostication, while also suggesting complementary contributions from metabolic (i.e., lactate, arterial BE) and clinical (i.e., referral center, birth weight) variables.

Limitations

Despite the inclusion of a relatively large cohort, this study was affected by a class imbalance, with a smaller proportion of patients experiencing poor neurodevelopmental outcomes or death and lower rates of epilepsy at follow up. This imbalance may have limited the initial statistical analyses as well as the predictive performance of the machine learning models, particularly in accurately identifying high-risk patients. Increasing the representation of poor outcomes in future datasets may enhance model calibration and discrimination. A larger sample size would also support stratification of neurodevelopmental outcomes into specific cognitive and behavioral domains, which could reveal domain-specific vulnerabilities (e.g., executive function) when examined in conjunction with long-term follow-up data.

This analysis used neurological examinations from NICU admission, typically within the first 6 h of life, rather than immediate postnatal assessments. Due to documentation constraints, earlier clinical findings at the referring hospital may have influenced the decision to initiate therapeutic hypothermia, and patients may have shown clinical improvement or deterioration during transfer. This impacts the utility of using mild, moderate, and severe examination grouping in this dataset.

While anti-seizure medications can transiently alter EEG background features (e.g., increased discontinuity or voltage suppression), prior studies have shown EEG background typically recovers within 4 h of administration (22). Given that EEG background was assessed continuously over 24 h intervals, our analyses likely reflect stable background characteristics rather than short-term medication effects (22). Additionally, ASMs as well as sedative medications were used as covariates in multiple analyses as described above.

While our use of XGBoost and repeated sampling strategies was intended to enhance predictive performance, we acknowledge that these methods do not provide causal estimates of feature effects. If the primary objective were to estimate the independent effect of EEG background or ictal burden, methods such as inverse probability of treatment weighting (IPTW) or marginal structural models (MSMs) would be more appropriate and interpretable. Future work aiming to quantify causal effects should consider these approaches to complement the predictive modeling framework presented here.

Lastly, our models have not yet undergone external validation, which limits generalizability. Nonetheless, the results are consistent with findings from larger studies, supporting the robustness of the approach.

Conclusions

Neonates with HIE present with a spectrum of clinical and EEG findings, from having no seizures to status epilepticus, and normal to severely abnormal backgrounds. As such, prognosis for short- and long-term outcomes can be challenging. Scoring EEGs using separate measures for background and ictal characteristics may serve to better predict patients with in- hospital mortality, poor neurodevelopmental outcomes, and future epilepsy risk. Results for our study suggest that severely abnormal EEG background scores are significantly associated with outcomes, whereas acute seizures at the time of presentation are not. Characterization of specific EEG patterns aids in understanding clinical progression, guiding treatment decisions, and providing families with earlier prognostication.

Data availability statement

The data analyzed in this study is available upon request. Requests to access these datasets should be directed toa3Jpc3RpbmUud29vZHdhcmRAYWxiZXJ0YWhlYWx0aHNlcnZpY2VzLmNh.

Ethics statement

The studies involving humans were approved by IRISS University of Calgary #REB15-1249. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation in this study was provided by the participants' legal guardians/next of kin.

Author contributions

KW: Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Writing – original draft, Writing – review & editing. PD: Data curation, Formal analysis, Methodology, Writing – review & editing. KA: Data curation, Formal analysis, Writing – review & editing. PM: Data curation, Formal analysis, Writing – review & editing. MB: Conceptualization, Data curation, Methodology, Writing – review & editing. KM: Conceptualization, Writing – review & editing. NF: Data curation, Formal analysis, Supervision, Writing – review & editing. ME: Conceptualization, Methodology, Supervision, Writing – review & editing.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. This work was funded by the Alberta Children's Hospital Research Institute.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Abbreviations

HIE, hypoxic ischemic encephalopathy; TH, therapeutic hypothermia; cvEEG, continuous video electroencephalography; IBI, inter-burst-interval; IUGR, intrauterine growth restriction.

References

1. Kurinczuk JJ, White-Koning M, Badawi N. Epidemiology of neonatal encephalopathy and hypoxic-ischaemic encephalopathy. Early Hum Dev. (2010) 86(6):329–38. doi: 10.1016/j.earlhumdev.2010.05.010

PubMed Abstract | Crossref Full Text | Google Scholar

2. Glass HC, Hong KJ, Rogers EE, Jeremy RJ, Bonifacio SL, Sullivan JE, et al. Risk factors for epilepsy in children with neonatal encephalopathy. Pediatr Res. (2011) 70(5):535–40. doi: 10.1203/PDR.0b013e31822f24c7

PubMed Abstract | Crossref Full Text | Google Scholar

3. Lynch NE, Stevenson NJ, Livingstone V, Murphy BP, Rennie JM, Boylan GB. The temporal evolution of electrographic seizure burden in neonatal hypoxic ischemic encephalopathy. Epilepsia. (2012) 53(3):549–57. doi: 10.1111/j.1528-1167.2011.03401.x

PubMed Abstract | Crossref Full Text | Google Scholar

4. Lynch NE, Stevenson NJ, Livingstone V, Mathieson S, Murphy BP, Rennie JM, et al. The temporal characteristics of seizures in neonatal hypoxic ischemic encephalopathy treated with hypothermia. Seizure. (2015) 33:60–5. doi: 10.1016/j.seizure.2015.10.007

PubMed Abstract | Crossref Full Text | Google Scholar

5. Kwon JM, Guillet R, Shankaran S, Laptook AR, McDonald SA, Ehrenkranz RA, et al. Clinical seizures in neonatal hypoxic-ischemic encephalopathy have no independent impact on neurodevelopmental outcome: secondary analyses of data from the neonatal research network hypothermia trial. J Child Neurol. (2011) 26(3):322–8. doi: 10.1177/0883073810380915

PubMed Abstract | Crossref Full Text | Google Scholar

6. Kharoshankaya L, Stevenson NJ, Livingstone V, Murray DM, Murphy BP, Ahearne CE, et al. Seizure burden and neurodevelopmental outcome in neonates with hypoxic-ischemic encephalopathy. Dev Med Child Neurol. (2016) 58(12):1242–8. doi: 10.1111/dmcn.13215

PubMed Abstract | Crossref Full Text | Google Scholar

7. Chen Y-J, Chiang M-C, Lin J-J, Chou I-J, Wang Y-S, Kong S-S, et al. Seizures severity during rewarming can predict seizure outcomes of infants with neonatal hypoxic-ischemic encephalopathy following therapeutic hypothermia. Biomed J. (2020) 43(3):285–92. doi: 10.1016/j.bj.2020.06.008

PubMed Abstract | Crossref Full Text | Google Scholar

8. Chalak LF, Pappas A, Tan S, Das A, Sánchez PJ, Laptook AR, et al. Association between increased seizures during rewarming after hypothermia for neonatal hypoxic ischemic encephalopathy and abnormal neurodevelopmental outcomes at 2-year follow-up: a nested multisite cohort study. JAMA Neurol. (2021) 78(12):1484–93. doi: 10.1001/jamaneurol.2021.3723

PubMed Abstract | Crossref Full Text | Google Scholar

9. Bourel-Ponchel E, Querne L, Flamein F, Ghostine-Ramadan G, Wallois F, Lamblin MD. The prognostic value of neonatal conventional-EEG monitoring in hypoxic-ischemic encephalopathy during therapeutic hypothermia. Dev Med Child Neurol. (2023) 65(1):58–66. doi: 10.1111/dmcn.15302

PubMed Abstract | Crossref Full Text | Google Scholar

10. Fitzgerald MP, Massey SL, Fung FW, Kessler SK, Abend NS. High electroencephalographic seizure exposure is associated with unfavorable outcomes in neonates with hypoxic-ischemic encephalopathy. Seizure. (2018) 61:221–6. doi: 10.1016/j.seizure.2018.09.003

PubMed Abstract | Crossref Full Text | Google Scholar

11. Murray DM, Boylan GB, Ryan CA, Connolly S. Early EEG findings in hypoxic-ischemic encephalopathy predict outcomes at 2 years. Pediatrics. (2009) 124(3):e459–467. doi: 10.1542/peds.2008-2190

PubMed Abstract | Crossref Full Text | Google Scholar

12. Lemyre B, Chau V. Hypothermia for newborns with hypoxic-ischemic encephalopathy. Paediatr Child Health. (2018) 23(4):285–91. doi: 10.1093/pch/pxy028

PubMed Abstract | Crossref Full Text | Google Scholar

13. Sarnat HB, Sarnat MS. Neonatal encephalopathy following fetal distress. A clinical and electroencephalographic study. Arch Neurol. (1976) 33(10):696–705. doi: 10.1001/archneur.1976.00500100030012

PubMed Abstract | Crossref Full Text | Google Scholar

14. Barkovich AJ, Hajnal BL, Vigneron D, Sola A, Partridge JC, Allen F, et al. Prediction of neuromotor outcome in perinatal asphyxia: evaluation of MR scoring systems. AJNR Am J Neuroradiol. (1998) 19(1):143–9.9432172

PubMed Abstract | Google Scholar

15. Tsuchida TN, Wusthoff CJ, Shellhaas RA, Abend NS, Hahn CD, Sullivan JE, et al. American clinical neurophysiology society standardized EEG terminology and categorization for the description of continuous EEG monitoring in neonates: report of the American clinical neurophysiology society critical care monitoring committee. J Clin Neurophysiol Off Publ Am Electroencephalogr Soc. (2013) 30(2):161–73. doi: 10.1097/WNP.0b013e3182872b24

PubMed Abstract | Crossref Full Text | Google Scholar

16. Singh A, Yeh CJ, Boone Blanchard S. Ages and stages questionnaire: a global screening scale. Bol Med Hosp Infant Mex. (2017) 74(1):5–12. doi: 10.1016/j.bmhimx.2016.07.008

PubMed Abstract | Crossref Full Text | Google Scholar

17. Lo Vercio L, Amador K, Bannister JJ, Crites S, Gutierrez A, MacDonald ME, et al. Supervised machine learning tools: a tutorial for clinicians. J Neural Eng. (2020) 17(6):062001. doi: 10.1088/1741-2552/abbff2

Crossref Full Text | Google Scholar

18. MacEachern SJ, Forkert ND. Machine learning for precision medicine. Genome. (2021) 64(4):416–25. doi: 10.1139/gen-2020-0131

PubMed Abstract | Crossref Full Text | Google Scholar

19. Mooney C, O'Boyle D, Finder M, Hallberg B, Walsh BH, Henshall DC, et al. Predictive modelling of hypoxic ischaemic encephalopathy risk following perinatal asphyxia. Heliyon. (2021) 7(7):e07411. doi: 10.1016/j.heliyon.2021.e07411

PubMed Abstract | Crossref Full Text | Google Scholar

20. Glass HC, Numis AL, Comstock BA, Gonzalez FF, Mietzsch U, Bonifacio SL, et al. Association of EEG background and neurodevelopmental outcome in neonates with hypoxic-ischemic encephalopathy receiving hypothermia. Neurology. (2023) 101(22):e2223–33. doi: 10.1212/WNL.0000000000207744

PubMed Abstract | Crossref Full Text | Google Scholar

21. Glass HC, Soul JS, Chang T, Wusthoff CJ, Chu CJ, Massey SL, et al. Safety of early discontinuation of antiseizure medication after acute symptomatic neonatal seizures. JAMA Neurol. (2021) 78(7):817–25. doi: 10.1001/jamaneurol.2021.1437

PubMed Abstract | Crossref Full Text | Google Scholar

22. Deshpande P, Jain A, McNamara PJ. Effect of phenobarbitone on amplitude-integrated electroencephalography in neonates with hypoxic-ischemic encephalopathy during hypothermia. Neonatology. (2020) 117(6):721–8. doi: 10.1159/000511540

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: neonatal HIE, cEEG = continuous EEG, neurodevelopmental outcome, epilepsy prediction, background EEG, background EEG activity

Citation: Woodward KE, de Jesus P, Amador K, Mouches P, Braun M, Mohammad K, Forkert ND and Esser MJ (2025) The future is in the background: background EEG patterns, not acute seizures, predict epilepsy and neurodevelopmental outcomes in neonatal HIE. Front. Pediatr. 13:1560760. doi: 10.3389/fped.2025.1560760

Received: 14 January 2025; Accepted: 22 August 2025;
Published: 29 September 2025.

Edited by:

Hemmen Sabir, University Hospital Bonn, Germany

Reviewed by:

Elena Pavlidis, Central Hospital of Bolzano, Italy
Enrico Cocchi, University of Bologna, Italy

Copyright: © 2025 Woodward, de Jesus, Amador, Mouches, Braun, Mohammad, Forkert and Esser. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Kristine E. Woodward, a3Jpc3RpbmUud29vZHdhcmRAYWxiZXJ0YWhlYWx0aHNlcnZpY2VzLmNh

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.