- 1Department of Complex Systems, Institute of Computer Science of the Czech Academy of Sciences, Prague, Czechia
- 2Department of Electrical and Computer Engineering, Aristotle University of Thessaloniki, Thessaloniki, Greece
- 3Medical School, Aristotle University of Thessaloniki, Thessaloniki, Greece
- 4Clinical Research Program, National Institute of Mental Health, Klecany, Czechia
- 5Third Faculty of Medicine, Charles University, Prague, Czechia
Introduction: Major depressive disorder (MDD) remains a leading cause of disability worldwide, with limited objective biomarkers to guide treatment selection and monitor early therapeutic effects. This study examines whether pharmacological and neurostimulation treatments produce distinct alterations in directional brain connectivity patterns within the first week of treatment and whether baseline connectivity patterns differ between treatment responders and nonresponders assessed at 4–6 weeks.
Methods: We perform an information theory-based causality analysis on instantaneous phase time series derived from electroencephalography recordings of 176 MDD patients. Patients received either pharmacological treatment or neurostimulation and were recorded at baseline (visit 1) and one week post-treatment initiation (visit 2). We quantified directional information flow across 19 EEG channels in five frequency bands (: 1.5-3.5Hz, : 4-8Hz, : 8-12Hz, : 12.5-17.5Hz, : 18-25.5Hz) using global FLOW metrics and local INFLOW/OUTFLOW metrics per channel.
Results: Treatment modalities produced distinct frequency-specific alterations in brain causality. Pharmacological treatment significantly increased global and local information transmission in the β2 band from visit 1 to visit 2, with widespread effects across brain regions and larger increases in right hemisphere inflow. Neurostimulation treatment increased information flow primarily in the δ band, with the strongest effects originating from left hemisphere to the whole brain. At baseline, pharmacological nonresponders exhibited significantly higher α band information flow than responders, particularly for right-to-left hemisphere transmission and bilateral inflow. No significant baseline differences emerged between neurostimulation responders and nonresponders, though large effect sizes in δ band metrics suggest findings may achieve significance with larger samples.
Discussion: Early-phase directional brain connectivity analysis reveals that pharmacological and neurostimulation treatments engage distinct neural oscillatory mechanisms within the first week. Elevated baseline right-dominant α-band causal transmission identifies patients unlikely to respond to pharmacological treatment, enabling alternative interventions before weeks of ineffective therapy. These phase-based causality metrics offer promising biomarkers for early treatment stratification and monitoring. The ability to distinguish treatment responders from nonresponders at baseline and detect treatment-specific changes within one week, before clinical response manifestation, suggests these causality measures capture early mechanistic alterations that could inform timely treatment optimization decisions. (EUDRACT Nos. 2005-000826–22 and 2015-001639-19, registered via www.clinicaltrialsregister.eu).
1 Introduction
Major depressive disorder (MDD) is a widespread mental health condition affecting millions of people worldwide (1). It can contribute to various physiological illnesses characterized by enduring unhappiness, loss of interest in activities, and a range of physical and cognitive symptoms. Despite the development of novel treatment strategies for MDD, a critical clinical challenge remains: treatment selection is largely trial-and-error, with patients often enduring weeks of ineffective therapy before alternative approaches are considered. This prolonged period of treatment uncertainty stems from the absence of reliable early biomarkers to predict which patients will respond to specific treatment modalities and to monitor early therapeutic effects before clinical symptom changes manifest (2, 3). The diagnostic criteria for MDD predominantly focus on behavioral aspects and depend on self-reported symptoms, providing limited insight into underlying neurophysiological mechanisms that might guide treatment decisions. There is an urgent necessity for objective biomarkers of MDD to facilitate personalized treatment selection and enable early detection of therapeutic response. Determining efficient, non-invasive diagnostic and therapeutic approaches for MDD poses a significant challenge in clinical neuroscience. Herein two critical clinical questions are addressed: 1) Do pharmacological and neurostimulation treatments produce distinct, detectable alterations in brain connectivity patterns within the first week of treatment before clinical response manifests? 2) Can baseline directional connectivity patterns distinguish patients who will ultimately respond versus not respond to specific treatment modalities assessed after 4–6 weeks through patient self-reporting?
Electroencephalography (EEG) is a widely used, non-invasive neuroimaging technique that measures cerebral electrical activity with a high temporal resolution in the millisecond range, providing benefits over alternative brain imaging modalities. Numerous studies have examined changes in brain functional connectivity, i.e., the pattern of statistical dependencies between remote neurophysiological processes (4), in individuals with MDD, utilizing EEG data (5). However, most EEG analyses typically involves extraction of functional connectivity measures such as correlation, coherence or cordance derived from cerebral signals (6–11). These measures quantify the strength of relationships between brain regions but cannot determine the directionality of information flow. This limitation is significant because understanding which brain regions influence others may be critical for differentiating treatment mechanisms and identifying patients likely to respond to specific interventions. The data obtained from multiple scalp electrodes is analyzed across different frequency bands since these bands are linked to diverse neurophysiological processes and cognitive states. Connectivity patterns within and across these bands provide a more comprehensive and functionally relevant picture of brain dynamics.
Causality analysis addresses this gap by going beyond correlation or coherence to identify cause-and-effect relationships and directional influence patterns in brain networks. In MDD, several studies have already applied directional connectivity methods, including time- and frequency-domain Granger causality, to characterize abnormal information flow in cortico-limbic and fronto-parietal circuits. These approaches have mainly relied on models (typically linear or pre-specified non-linear) and have only rarely been extended to early treatment-related changes in large clinical cohorts. Comparatively little work has used information theory-based causality estimators (like transfer entropy) or estimators operating on phase dynamics to study MDD with EEG, particularly in multivariate set-ups and in the context of treatment prediction and early neurophysiological changes (12–15). The information-theoretic generalization of the Granger causality concept (16) provides a solid mathematical basis for several reliable causality inference methods (17, 18), including the phase dynamics adaptation used in this study (19). Analyzing causality in multi-channel EEG signals might enhance treatment stratification by revealing how different therapies alter the direction and intensity of causal influences among brain regions, thereby illuminating fundamental therapeutic mechanisms.
In this study, we perform a multivariate causality analysis on EEG data recorded from participants diagnosed with MDD and treated with medication or neurostimulation (specifically, low-frequency repetitive transcranial magnetic stimulation [rTMS] or transcranial direct current stimulation [tDCS] targeting the dorsolateral prefrontal cortex). While prior literature has established biomarkers based on non-directional measures, it remains unclear whether directional phase-based causality exhibits similar patterns. To mitigate this gap without bias, we adopt a data-driven approach to address the two aforementioned clinical questions, i.e., of detecting treatment-specific alterations in brain connectivity and of treatment response stratification from baseline connectivity patterns. Early identification of treatment-specific neurophysiological signatures and response-predictive patterns could be particularly advantageous in the initial phase of treatment, as it may facilitate prompt identification of patients unlikely to benefit from their current therapy, enabling timely transition to alternative interventions. Answering these questions could enable earlier, more informed treatment decisions and reduce the trial-and-error period that prolongs patient suffering.
To estimate causality across brain regions, we employ an information-theoretic method called partial mutual information from mixed embedding (PMIME) (19, 20), and examine alterations in brain connectivity and the directional influence among these regions following the initiation of antidepressant treatment. Traditional causality analysis methods such as Granger causality, transfer entropy or partial transfer entropy fail to capture nonlinear relationships in high-dimensional real-world systems. Furthermore, they either assume linear dependencies, are difficult to extrapolate to higher dimensions, are data demanding or are computationally expensive. PMIME addresses these challenges by leveraging mutual information and optimized embedding strategies, and is the only method capable of “general” causality analysis (i.e., no assumption of a model form) in high dimensions. PMIME is a model-free method that is widely applicable for identifying direct causal links between two components in multivariate systems, such as multi-channel EEG signals. This study employs pPMIME, which is a version of PMIME that operates on instantaneous phase time series of EEG data and whose performance has been extensively validated on artificial and real-world data. Phase-based causality analysis effectively reveals genuine causal relationships in weakly linked oscillatory systems, mitigating the effect of noise potentially introduced by amplitude variations. It is a well-known fact that weakly coupled oscillators can alter the phases of each other while their amplitudes remain unchanged (21). We leverage the oscillatory nature of EEG signals and employ phase-based causality analysis to achieve more reliable results. In Section 2 we present the methodology and data used in the analysis. In Section 3 we present the obtained results and then discuss them in Section 4.
2 Methods and materials
2.1 Phase-based causality analysis (pPMIME)
PMIME/pPMIME (19) is an iterative method that uses forward selection to estimate inherent lagged dependence in coupled multivariate nonlinear dynamical systems. It generates mixed embedding vectors from multiple observed time series of connected subsystems to reveal the causal relationships. The method employs mutual information/conditional mutual information (MI/CMI) / as a selection criterion and uses a statistical test as a stopping criterion of the forward selection procedure. MI measures the average shared information between random variables and , whereas CMI measures the same but conditioned by variable . Consider a coupled dynamical system consisting of subsystems. The time series data for these K variables are denoted by , where is the time index and is the index of the variables. PMIME aims to predict the -steps ahead future values of a target variable by finding a mixed embedding vector with an undetermined dimension that maximizes the MI term . The elements of belong to the superset , with the sets containing the lagged version of each variable, i.e. , , where is the maximum lag, which can be arbitrarily chosen to be appropriately large. Starting from an empty vector Ø, the embedding vector is populated sequentially using a forward selection procedure based on the maximization of MI/CMI. The CMI terms for step of the procedure are of the form , where is the embedding vector at the previous step () and is any element of . The that maximizes the CMI term is selected to be added in in order to create the new . The whole procedure is terminated using a bootstrap resampling test as a stopping criterion, i.e., testing the null hypothesis . Detailed information regarding the entire procedure can be found in (19, 22). If for a given , any element of is contained in , then we can interpret that causes . A causality measure can be derived from the following normalized CMI term, bounded between and :
In Equation 1, is split into elements that belong to and all other elements, i.e., and corresponds to the proportion of information in which can be explained by the past values of when taking into account the values of the rest of . PMIME essentially tries to detect which of the present and past values (lags) of , , i.e., can be used to optimally predict the future values , where T can be any value of .
Similarly, when applied to phase time series (), the algorithm (pPMIME in this case) tries to detect which of the present and past lags of , i.e., for all , can be used to predict the values of the phase increments . Paluš and Stefanovska demonstrated that, in the bivariate case, yields more sensitive causality tests than itself (23). Equation 1 is modified accordingly using the phase variables.
The phase time series can be extracted from the original time series using various methodologies, but herein we employ the commonly used Hilbert transform (HT) method (24, 25). Recently, it was shown that for systems with well-behaving phases or narrowly bandpass filtered data (like the EEG filtered in EEG bands), values of and are sufficient to capture causality with reduced computational time (22), so this is adopted herein. With respect to the original pPMIME, we make another simple but intuitive logical change that is critical for mitigating volume conduction effects. Instead of starting from an empty embedding vector, we start with one that includes the lag of the target variable; i.e., if we are creating embedding vectors for then the embedding vector is initialized as . This initialization effectively substantially reduces the effect of zero-lag (instantaneous) interactions common in EEG due to volume conduction. Since the CMI criterion only selects source components that provide novel information about the future beyond what is known from the target’s own current state, strong instantaneous correlations (where source and target carry identical information) are recognized as redundant and excluded. This ensures that the derived metrics reflect genuine lagged information flow.
2.2 Artificial data and method benchmark
We benchmark the pPMIME method for three cases of basic coupled nonlinear oscillators. We study coupled Rössler systems (26, 27) given by sets of the following equations (one set for each oscillator):
where , and is the total number of subsystems or nodes. The parameters , , and were kept constant for all and were chosen so that all subsystems maintain a stable periodic orbit. The elements of the binary () adjacency matrix are denoted by and define the connectivity structure of the system, with a value of for indicating a causal connection from subsystem to . The parameters determine the main frequency of oscillation of the subsystem and are set to different values for the subsystems. Finally is the coupling strength, which we consider to change globally (i.e., it is the same for all subsystems). We studied three cases of , , and coupled subsystems to validate our method on multi-coupled systems with known directions of influence to imitate real-life scenarios where there are large coupled systems. The values of were chosen as for (step of for (step of and for (step of For and we chose the elements of the adjacency matrix such that each node was unidirectionally coupled, forming a serial connection for and a “ring” of connections for . For we generated an adjacency matrix randomly such that only of the connections were present. The adjacency matrices for the three cases are displayed in Figures 1a–c.
Figure 1. The adjacency matrices for the three cases of coupled Rössler oscillators with N = 3, 5, and 19 subsystems in (a–c), respectively, and the corresponding pPMIME causality matrices (RP values) for the coupling strength where the average of all real connections was maximum in (d–f). Average RP values ± standard error (shaded area above and below the average - barely visible in some cases) as a function of coupling strength for the three networks in (g–i). RP values for real connections are plotted in solid blue lines and those for non-connections in orange dotted lines. For clarity, in panel (i), instead of the individual lines, their averages are plotted.
We studied 100 different values of coupling strength (step size of ), and for each one we generated data points from Equation 3 using the Runge-Kutta (4, 5) method (ode45 in MATLAB ver. R2021a). We chose a sampling time of 0.314 time units, and the first 1000 transient points were discarded. The data were Hilbert transformed, and the first and last 2032 time points were discarded to eliminate any distortion of the data owing to the edge effect of the transform. The remaining data were then divided into 31 segments, with each segment containing 4064 time points. pPMIME and Equation 2 was then applied to each of these segments, which resulted in () causality matrices that contained the values for all pairs of . The matrices were then averaged over all 31 segments. Each element in the matrix quantifies the causal influence of a particular node on another node, with the diagonal elements corresponding to the self-influences. For example, the component of the matrix provides the causal influence of node on node . Figures 1d–f show causality matrices for the three cases of , respectively, each for a selected coupling strength for which the average of all real connections was maximum. The specific values of epsilon for the three cases ( for , , and , respectively) are small enough to not have widespread synchronization but large enough so that the obtained values are substantial and illustrate the results sufficiently well. As expected, the causality matrices correlate well with the adjacency matrices. Figures 1g, h, respectively, show the values for the individual connections for and , as a function of coupling strength. For the case of , in Figure 1i, we show the average of the values corresponding to real connections and of those that correspond to no connections, again as a function of coupling, for reasons of visual clarity. All values plotted are averages across all segments, with error bars showing the standard error of the mean. The real connections are plotted as solid blue lines, and the rest are indicated by orange dotted lines. For all three cases and for low coupling strength values (i.e., prior to the synchronization of the subsystems), the blue lines exhibit monotonically increasing trends and exceed the values of the orange lines. This indicates that the method can accurately capture the direction of causality between the subsystems. Upon synchronization of subsystems, causality cannot be uniquely inferred, and we can see that the orange lines increase and the blue lines decrease. In addition, as the number of subsystems increases, the strength of the detected connections decreases.
2.3 EEG data
The study included 176 adult inpatients (age range 18–65 years) with a primary diagnosis of MDD, single or recurrent episode, without psychotic features. The diagnosis of MDD was established according to DSM-IV criteria and confirmed in all participants using the Mini-International Neuropsychiatric Interview (M.I.N.I., Czech version 5.0.0). To ensure at least moderate depressive symptom severity at baseline, participants were required to have a total score on the Montgomery–Åsberg Depression Rating Scale (MADRS) (28, 29) of ≥ 20 and, in the majority of cohorts, a Clinical Global Impression (CGI) score of ≥ 4. The sample predominantly comprised patients with treatment-resistant depression. All participants were hospitalized due to an unsatisfactory response to previous outpatient treatment and fulfilled at least Stage I criteria for treatment-resistant depression according to Thase and Rush (i.e. nonresponse to ≥ 1 adequate antidepressant trial in the current episode). The adequacy of prior antidepressant treatment was verified using the Antidepressant Treatment History Form (ATHF); an ATHF score of ≥ 3 was required to confirm at least 4 weeks of treatment at an adequate dose. A psychotropic medication washout period (median duration 4 days) was implemented for all participants prior to initiation of the new antidepressant treatment. Stringent exclusion criteria were applied to reduce potential confounding, particularly related to concurrent substance use and additional psychopathology. Participants were excluded if they had a current or past history of alcohol or illicit drug abuse or dependence, or any additional Axis I or Axis II psychiatric comorbidity. To minimize pharmacological confounding, individuals with prior exposure to fluoxetine were excluded because of its prolonged biological half-life, and patients who had received electroconvulsive therapy (ECT) within the 3 months preceding study enrollment were not included, given the substantial impact of ECT on EEG measures. More details regarding the sample, recruitment criteria and treatment can be found in previous clinical analysis reports (30–36). Details regarding sex, age, MADRS score, illness duration and response to treatments are shown in Table 1. The patients were subjected to either pharmacological or neurostimulation treatments for four weeks. Individuals who showed a ≥ 50% reduction in the MADRS score at the end of the treatment were considered as responders to the treatment. Pharmacological treatment included the following classes of antidepressants: serotonin norepinephrine reuptake inhibitors (SNRI), selective serotonin reuptake inhibitors (SSRI), norepinephrine-dopamine reuptake inhibitors (NDRI), noradrenergic and specific serotonergic antidepressants (NaSSA), and tricyclic antidepressants (TCA). Neurostimulation treatment included low-frequency repetitive transcranial magnetic stimulation (LF-rTMS) at 1 Hz over the right dorsolateral prefrontal cortex (DLPFC) and transcranial direct current stimulation (tDCS) consisted of anodal stimulation over the left DLPFC and cathodal placement over the right supraorbital region, delivered at 2 mA. There were two EEG recording sessions for each patient, one before treatment initiation (visit 1) and a second after one week of treatment (visit 2). Data were recorded with the participants lying in a semirecumbent position with their eyes closed in a maximally alert state. Approximately 10 minutes of data were sampled at 250 Hz or 1000 Hz from 19 electrode channels (Fp1, Fp2, F3, F4, C3, C4, P3, P4, O1, O2, F7, F8, T3, T4, T5, T6, Fz, Cz, and Pz) placed according to the International 10–20 system (37).
Table 1. Patient demographics and information. Respondents (R), nonrespondents (NR), number of patients (n).
This study was conducted in accordance with the principles outlined in the Declaration of Helsinki and adhered to applicable guidelines for Good Clinical Practice. Ethical approval was obtained from the Ethics Committee of the Prague Psychiatric Centre/National Institute of Mental Health for all study protocols contributing data to this research, with the current retrospective analysis falling under the existing institutional ethics approval for the EEG registry. While a portion of the dataset was derived from clinical trials registered in the EUDRACT database (EUDRACT Nos. 2005-000826–22 and 2015-001639-19, registered via www.clinicaltrialsregister.eu), additional data were obtained from ethically approved studies that were not formally registered in public trial registries. In all cases, adult participants were thoroughly informed about the study procedures and objectives, and they provided written informed consent prior to participation that explicitly authorized the secondary use of their clinical and neurophysiological data for future research purposes.
2.3.1 Preprocessing and causality estimation
The data have been used in previous studies and were preprocessed as described in Ref. (38). In short, the preprocessing included removing extra channels, resampling records to a common 250 Hz sampling frequency, discarding the first and last 30 seconds of each record, re-referencing to the average reference, bandpass filtering the data between 1 and 40 Hz, and removal of data segments with high-power artefacts. The overall aim is to increase the signal-to-noise ratio and make the data more interpretable. More details on the specifics of the preprocessing steps and their individual effects are provided in (38). We further filtered the data into six different frequency bands to focus on causality related to the specific brain activity in each of these bands. The bands were divided as follows: δ : (1.5Hz - 3.5Hz), θ : (4Hz - 8Hz), α : (8Hz - 12Hz), β1: (12.5Hz - 17.5Hz), β2: (18Hz -25.5Hz) and γ : (26Hz - 40Hz). We implement pPMIME on data segments of 8s duration (2000 time points), as the estimation of CMI has large data requirements. Thus, for each data segment, we estimated six 19 × 19 matrices, one for each band. For each subject, a varying number of segments were available, ranging from 14 to 92. The findings of the γ band are not used herein because of RP estimation problems due to improper phase extraction at high frequencies with HT (attributed to the extensive range of the γ band).
2.4 Flow metrics
The pPMIME method generates causality matrices of dimensions () for the EEG channels. Each element in the matrix quantifies the causal influence of one channel on another. As detailed in Section 2.1, each entry is a normalized conditional mutual information term bounded between and that expresses the proportion of phase increment information in channel that is uniquely explained by the past of channel , after accounting for the past of and of all other channels. Because all entries share this common, dimensionless scale, averages of over anatomically defined sets of connections can be interpreted as expected directional information flow within or between regions. The causality matrices are structured such that the initial eight nodes correspond to channels in the left hemisphere (Fp1, F3, C3, P3, O1, F7, T3, and T5), the subsequent eight nodes correspond to channels in the right hemisphere (Fp2, F4, C4, P4, O2, F8, T4, and T6), and the final three nodes correspond to channels located along the midline of the head (Fz, Cz, and Pz). We define FLOW metrics based on the average of specific parts from a standard causality matrix, as illustrated in Figure 2. This allows for the quantification of information flow between brain regions derived from the causal relationships identified in the EEG channel data. We define nine FLOW metrics: The Total Flow () metric is the aggregate information flow across the channels, calculated by averaging all elements of the matrix. We define metrics for the left hemisphere, , , , and , which respectively represent the information flow from the left hemisphere to itself, to the right hemisphere, to all the regions of the brain, and the information flow from all regions of the brain to it. Similarly, we define metrics for the right hemisphere, which we denote as , , , and . The averages are calculated for the regions (excluding the diagonal elements corresponding to the self influences) indicated by the red boxes labeled LL, LR, out L, in L, RR, RL, out R, and in R in Figure 2. Additionally, we study the inflow and outflow for each channel separately, which are defined as , . So in total, we have metrics. The first are the FLOW metrics, which are global and quantify the information flow of the brain as a whole or per brain hemisphere. The other are the INFLOW and OUTFLOW per channel, which are local metrics and characterize small brain regions that correspond to the positions of the electrodes.
Figure 2. Causality matrix of a subject, along with the regions that are used to define FLOW metrics for hemisphere-to-hemisphere interaction (a), per-hemisphere inflow (b) and per-hemisphere outflow (c).
2.5 Statistical analysis
After obtaining the causality matrices for each subject over all time segments, visits, and frequency bands, we compute the metrics defined in Section 2.4 for every time segment of each participant. Subsequently, we calculate the mean of the metrics across all the time segments. Thus, for each band and each of the 47 metrics, we have two sets of 176 values (one set for each visit). Since there are multiple factors (response, treatment, and visit) and one of them is a repeated factor (the EEG recording is performed twice on the same patient), we analyzed the data with repeated measures analysis of variance (ANOVA) (39). The repeated measures ANOVA design includes two independent between-subject factors (response and treatment) and one repeated (dependent) within-subject factor (the visit), each with two levels. The response factor has two levels: respondents and nonrespondents, while the levels for the treatment factor are pharmacological and neurostimulation. For the visit, the two levels are visit 1 and visit 2. The setup of the repeated measures ANOVA model includes the individual factors along with all their interactions (3 pairwise and 1 with all three). The analysis for each EEG frequency band is performed separately. Because of the multiple factors and the “rigidity” of the ANOVA design, we do not actually derive statistical conclusions from these p-values; rather, we utilize these results as preliminary information to guide subsequent analysis. Informed by the outcomes derived from the repeated measures ANOVA study, further post-hoc analysis for the comparison of individual patient subgroups is performed with Student’s t-tests (independent or paired), and the effect size of any change/difference is estimated by Cohen’s (40). Non-parametric tests (41) gave similar results to the t-tests, with p-values generally slightly higher with certain differences in statistical significance for some metrics, but no substantial change in the overall conclusions. The ultimate goal of this analysis is to reveal which, if any, of these metrics has the potential to serve as biomarkers.
Our study is exploratory and hypothesis-generating in nature. Given that pPMIME captures non-linear, directional dependencies distinct from the linear correlations found in most prior literature, we opted against restricting our analysis to pre-defined regions of interest. There are no specific hypotheses tested where there is an expected difference based on some prior knowledge. We are conducting a “blind” investigation on which metrics show a difference, and to do this, we employ a comprehensive set of metrics on all possible EEG bands, expecting that most of them will probably not provide statistically significant results. As a matter of fact, we expect our significant results to be constrained to one or, at most, two frequency bands. Furthermore, certain metrics are, by construction, correlated. Because of the high correlation and large number of metrics, global corrections for multiple comparisons are very likely to lead to most p-values being “over-corrected” and rendered insignificant. Since the study is exploratory, over-correcting the p-values is antithetical to the main objective, and in accordance to recent recommendations (42–44), herein we treat our analysis as three separate experiments based on the three metric types (i.e., FLOW, INFLOW, OUTFLOW). Each experiment has its corresponding experiment-wise hypothesis : There is no change in the values of any metric of the specific type, along with a “family” of statistical hypothesis tests for the individual metrics of each type. Given that the hypotheses tests inside each family are interchangeable for the corresponding experiment-wise hypothesis (but not interchangeable for the other two experiments) we employ the Benjamini–Yekutieli FDR controlling procedure (45) to obtain adjusted p-values per metric family and EEG band. All p-values in the following section are FDR adjusted and assessed at the α = 0.05 significance level. The original, unadjusted p-values from the t-tests and the p-values from the non-parametric permutation tests are provided in the Supplementary Materials, Tables 4–7. Controlling for the effects of age and sex using linear models instead of Student’s t-tests did not substantially change our statistical results, neither quantitatively nor qualitatively.
3 Results
3.1 Results from repeated measures ANOVA
The results of the repeated measures ANOVA are graphically presented in Figure 3 and summarized in Table 2. The figure displays a colormap highlighting the p-values for the factors and interactions with p< 0.05 for each metric. The colormap is organized vertically in seven blocks separated by red horizontal lines. Each block corresponds to a factor or interaction of the factors, and within the block are the five frequency bands. The horizontal axis displays the metrics. The three families of metrics are separated by black vertical lines. The p-values ≥ 0.05 are all set to yellow, while the colder colors correspond to lower p-values. The text around the colormap denotes factors, bands, and metrics. For example, for the TF metric, the treatment factor is significant in the band () and is presented in the first column, the first value of the second block. For the same metric, the only other p-values that are below are the response-treatment and treatment-visit interactions in the (, , respectively). Table 2 summarizes the repeated measures ANOVA results by presenting the number of metrics/bands that provide a significant p-value for each factor/interaction.
Figure 3. The colormap displays p-values from the repeated measures ANOVA. All p-values above 0.05 are highlighted in yellow. R, Response; T, Treatment; V, Visit. Each row represents the results of a particular band. There are seven blocks corresponding to each factor, separated by a horizontal red line. The metrics corresponding to FLOW, INFLOW and OUTFLOW per channel are separated by a black vertical line.
Table 2. Total number of metrics/band combinations that provide significant p-values for different factors and their interactions.
In Figure 3, we observe that the majority of p-values that are less than 0.05 are mostly for the and the bands, and that INFLOW metrics show more cases of statistical significance than the OUTFLOW metrics. The summarized repeated measures ANOVA results from Table 2 show that the three-way interaction factor is not significant for the majority of the metrics and nearly all bands. Only the INFLOW exhibits significant p-values for this interaction for just five metrics spread across all bands. The pairwise interactions between the treatment factor with response and the treatment factor with visit yield the highest number of significant p-values (16 and 26, respectively). On its own, the treatment factor yields 36 statistically significant metric/band combinations. These results indicate that the treatment has the most significant effect on the values of the metrics. Thus, the results suggest analyzing the data from the two treatments independently. The single factors of response and visit give significant values for 12 and 17 cases, but their interaction gives 7, and most of them have relatively high p-values (5 out of the 7 are around 0.04 and the 2 around 0.015). This indicates that even though there is a change in the metric values between visits and there exists a difference between those that responded to treatment and those that did not, these two factors are relatively independent. That is, changes in metric values between visits 1 and 2 are not much related to treatment outcome (response). Combining this with the result for the treatment and visit interaction, we can deduce that treatment may be a strong confounding factor when studying the relationship between response to treatment and metric values. We note that since visit 2 was one week after visit 1, but the respondent/nonrespondent assessment occurred at week 4, it is possible that the week 2 metrics (i.e., brain connectivity) may not have changed enough to “capture” the treatment outcome.
Based on the results of the repeated measures ANOVA, we chose to examine the subjects who received pharmacological treatment and neurostimulation independently. Additionally, we perform two distinct analyses. First, we compare the metrics of the two visits. Here we are not concerned with the treatment outcome but with what effect, if any, the two treatments have on the metrics and consequently on the causal relation in the brain. The second analysis compares the metrics between respondents and nonrespondents to treatment. In this case we compare the metrics of visit 1 for the two response groups. We exclude the visit 2 data for two reasons. First, according to the repeated measures ANOVA results, there is no significant interaction between the response and visit interactions. Second, we cannot be sure about the therapeutic state of the patients at visit 2. It is possible that for some subjects the treatment has produced a change in brain connectivity related to therapy response, while in others it may not have. As a matter of fact, analysis of the visit 2 data alone produced results similar to that of visit 1, but a bit worse, further strengthening the idea that the week 2 brain connectivity changes may not be indicative of the outcome for all subjects.
In the rest of the manuscript we only present results for the FLOW and INFLOW metrics. The ANOVA results for the OUTFLOW metrics show that the statistically significant results for the cases of interest are either much fewer than the INFLOW metrics (2 versus 16 for the treatment/visit interaction) or spread across bands (e.g., compare Figure 3 results for INFLOW and OUTFLOW for the treatment factor). For OUTFLOW, a minimum of one and a maximum of four channels were sporadically significant in a particular band, whereas all other channels had high p-values. All numerical values of the repeated measures ANOVA p-values are presented in the Supplementary Materials (Tables 1–3) for the sake of completeness.
3.1.1 Effect of treatment on brain causality
This section presents the results of the change in metrics between visits 1 and 2 for the two therapy groups independently. We perform Student’s t-test (paired) to compare the metric values from two visits. The p-values for all FLOW metrics and all five bands are listed in Table 3. The and the δ bands are primarily significant for the FLOW metrics in the pharmacological and neurostimulation groups, respectively. In the first group, the p-values in the band indicate a substantial difference between visits across all FLOW metrics. In the second treatment group TF, LL, LR, out_L, in_L, and in_R are significantly different in the δ band. Figure 4 illustrates the variation in all the FLOW metrics between the two visits for the pharmacological group in the band, and Figure 5 presents the same for the neurostimulation group in the . No metric achieved statistical significance after FDR correction in the other three frequency bands; therefore, we omit the presentation of these results graphically (figures corresponding to all metrics and all bands are presented in the Supplementary Materials, Figures 1–5). In both figures, the y-axis of the panels displays the metric values averaged over all subjects along with the standard errors of the mean.
Table 3. P-values for the comparison of the FLOW metrics between visits 1 and 2 for the two treatment groups. p-values< 0.05 are in bold font.
Figure 4. Change in metric values between visit 1 (v1) and visit 2 (v2) in the β2 band for the pharmacological treatment. The y-axis of all panels (a–i) displays the metric values averaged over all subjects along with the standard errors.
Figure 5. Descriptions of all panels (a–i) are similar to Figure 4, but for the neurostimulation treatments and the δ band. The dashed lines in panels (e–g) indicate that there was no statistically significant difference in these cases.
For the pharmacological group, in the band, all the FLOW metrics (Figures 4a–i) show a statistically significant increase from visit 1 to visit 2 with p-values ranging between < 0.001 and 0.049. The smallest p-values are observed for metrics TF, in_L, and in_R, all less than 0.001. The effect size for these three metrics based on Cohen’s d is moderate, or close to moderate (d = 0.3885, 0.3166, 0.4138, respectively). For the remaining metrics, the effect size is small (d in the range of 0.1900 − 0.2429).
In the case of the neurostimulation group and the band, most metrics, but not all, show significant changes. The FLOW metrics , , , , , and exhibit a statistically significant increase between the two visits with very low p-values, at a maximum of (Figures 5a–d, h, i). We note that the smallest p-values () correspond to flows originating from the left hemisphere ( and ), for which the effect size is again moderate (, respectively) but larger than the effect sizes observed in the pharmacological group. Metrics , , , and also exhibit a moderate effect size change ( in the range of ). Contrary to what we observe for the left hemisphere, the metrics corresponding to outflow from the right hemisphere (, and ) do not exhibit any statistically significant changes (Figures 5e–g), with very small effect size ( in the range of ).
Figures 6 and 7 illustrate the results of the change in inflow per channel between the two visits. The results are presented graphically in brain topographical maps and are given numerically in the Supplementary Materials, Table 5. In both figures, the first row of panels shows the p-values for the 19 channels in all five bands. The channels that show statistically significant changes in inflow from visit 1 to visit 2 are highlighted in green text. The results for the pharmacological group are presented in Figure 6. The second and third rows show topographical maps of the actual values of information flow per channel averaged over all subjects in the five frequency bands for visits 1 and 2, respectively. The fourth row shows the difference in the inflow as visit 2 minus visit 1. The and the bands are the only ones in which the channels had a statistically significant change in inflow. The overall inflow per channel increases from visit 1 to visit 2 in both bands. The channels that are significant for both and bands correspond to the right frontal (F4), central (Cz, Pz), left parietal (P3), posterior temporal (T5, T6), and occipital (O1, O2) regions. Fp2 and P4 are significant only in the band but still have p-values quite low for the band ( both, marginally above ). Comparing the topographic maps of the two visits for both the bands, it can be observed that the band shows a larger increase in the inflow compared to the band. The effect size of the inflow metrics for the significant channels is around moderate ( in the range of ) for the band and small for the band ( in the range of ). With respect to localization and the change in inflow, for the we observe that the statistically significant changes are quite widespread, excluding mainly left prefrontal and mid-temporal regions of the brain. The largest changes (smallest p-values) are observed for the midline frontal (Fz, ), parietal (P3, P4, ), occipital (O1, O2, ), and right posterior temporal (T6, ) regions. Comparing the left and right hemisphere p-values for the channels with statistically significant change (F3, C3, P3, O1, F7, T5 vs F4, C4, P4, O2, F8, T6), we see that the average p-value in the left is 0.012, while in the right it is 0.002 (almost an order of magnitude difference), indicating that there is more significant change (increase) of inflow in the right hemisphere. Average effect sizes of the significant channels in the left and right hemisphere are 0.3076 and 0.374, respectively.
Figure 6. Brain topographical plots for the comparison of inflow per channel between the two visits for the pharmacological group. The first row of panels is the p-values for the comparison. The 2nd and 3rd rows present the average values (across subjects) of the inflow for visit 1 (v1) and visit 2 (v2), while the 4th row presents the difference between the visits. Columns correspond to frequency bands, as indicated by the text above the top panels. Channels with p-values< 0.05 are indicated in green-colored text in the first row.
Figure 7. Similar to Figure 6, but for the neurostimulation group. Rows of the average inflow per channel for the two visits (2nd and 3rd rows in Figure 6) are omitted (given in Supplementary Materials, Figure 7).
Figure 7 presents the results for the neurostimulation group. In this case, the p-values are displayed in the first row of the panels and the difference in the average INLFOW per channel in the second row of panels. The δ band is the only band where a significant change in inflow is observed, with p-values of significant channels ranging between 0.013 and 0.040 (Cohen’s d in the range 0.2905 − 0.5348). The positive values of the difference in inflow for the δ band show that there is a significant increase from visit 1 to visit 2. For the δ band, the changes are relatively widespread, including the frontal (Fp1, F3, F4), central (Fz, Cz, Pz), parietal (P3, P4), occipital(O2), and temporal (T3, T5, T6) regions, but the most significant changes, i.e., the lowest p-values, are observed in the central and posterior temporal regions (Cz, T5, and T6 with p = 0.013 for all three and d = 0.5348, 0.5048, and 0.5007, respectively). Similar to pharmacological treatment, we observed that the average increase in inflow in the significant channels is greater in the right hemisphere (average d = 0.4203) than the left (average d = 0.3769).
3.1.2 Baseline brain causality difference between response groups
This section presents the results of the metric analysis with respect to the response to treatment, i.e., the comparison of respondents with nonrespondents. We perform Student’s t-test (independent) to compare the metric values of visit 1 for the two response groups. The p-values for all FLOW metrics and for all five bands are listed in Table 4. Any significant difference in the metrics between the response groups will inform us about a difference in brain connectivity patterns of individuals who are going to potentially respond or not to the treatment. From Table 4, we observe that only the band in the pharmacological group shows significant differences for the FLOW metrics , , , , , and with p-values for all six metrics (the equality in the p-values is due to the FDR adjustment). For the neurostimulation group, there is no metric that achieves statistical significance. Prior to p-value adjustment there were three metrics in the band that showed significant change (, , and with , , and , respectively), but they did not survive the FDR procedure. The difference of the significant metric values for the pharmacological group in the band is depicted in Figure 8. The y-axis of the figure displays the average over all subjects of the metrics for each response group along with their standard errors. The bar plots indicate that the nonrespondents have higher values for all these metrics than the respondents. The effect size for all these metrics is again at or near the moderate level ( in the range of ). While statistically significant, these effect sizes indicate a degree of distributional overlap between responders and nonresponders, suggesting that these metrics should be interpreted as group-level mechanistic differences rather than standalone individual diagnostic tools. The bar plots corresponding to all metrics and bands are presented in the Supplementary Materials (Figures 8–12).
Table 4. P-values for the comparison of the FLOW metrics at visit 1 between respondents and nonrespondents for the two treatment groups. Significant p-values less than 0.05 are in bold font.
Figure 8. Bar plots of the significantly different FLOW metrics between respondents (R) and nonrespondents (NR) to pharmacological treatment at the time of visit 1 in the α band.
Figure 9 depicts the results of the comparison of inflow per channel between respondents and nonrespondents at visit 1 in all five frequency bands. The topographic plots depict the p-values for the comparisons, along with the difference in inflow averaged over all subjects. The first two rows correspond to the pharmacological group, and the last two correspond to the neurostimulation group. The band is the most significant band for the pharmacological group. The most significant differences between the two response groups can be localized to the frontal (Fp1, F4), left anterior temporal (F7), right parietal (P4), right occipital (O2), and central (Pz) regions (all six having p-values and Cohen’s to , indicating moderate effect size). The channels Fp2, Fz, F8, C3, C4, and P3 had p-values , which are marginally above the significance threshold. With respect to the nature of the difference, for all the significant channels, the nonrespondents have a higher inflow per channel than the respondents. The band also had two channels (Fp1 and T5) with before FDR adjustment, but they lost significance after adjustment, and the other three bands had no significant channels.
Figure 9. Brain topographical plots for the comparison of inflow per channel between respondents (R) and nonrespondents (NR). The first row of panels is the p-values for the pharmacological (Ph) group, and the second row is the difference in inflow between R and NR averaged over all subjects. The 3rd and 4th rows are the same as the first two but for the neurostimulation (St) group. Columns correspond to frequency bands, as indicated by the text above the top panels. Channels with p-values< 0.05 are indicated in green-colored text in the 1st and 3rd rows.
For the comparison of the response groups in the case of neurostimulation treatment, the results are not as positive. No bands show any statistically significant difference between the response groups for any channel. Prior to the FDR procedure, channels Fp1, T3, T5, C4, and O2 in the band had , but after adjustment, all of them lost significance with the adjusted p-values of C4 to be and the rest higher. The band also had three channels (F7, T3, and F4), with p-values before adjustment, which lost significance afterwards. In both bands, contrary to the pharmacological group, the respondents have a (non-significantly) higher inflow per channel compared to the nonrespondents.
4 Discussion
In this study, we conducted an information theory-based causality analysis on EEG data of patients diagnosed with MDD who received pharmacological or neurostimulation treatments. We introduced a slightly adjusted version of a previously developed method (pPMIME) and first tested it on artificial data, where it performed very well in capturing causal relationships for weak coupling. We then used pPMIME to estimate the pairwise strength of the causal relationships between brain locations. To study information flow in the brain, we defined sets of global (FLOW) and local (INFLOW and OUTFLOW per channel) causality metrics. We used repeated measures ANOVA to determine which factors (visit, treatment, response, and all their possible interactions) were the most important. The ANOVA results indicated overall differences between the two treatment groups and that there was no interaction between the response and visit factors. Therefore, we decided to analyze separately the data from the two treatments. We focused on two independent studies: 1) analyzing the impact of each treatment modality on brain causality between visits and 2) comparing baseline/visit 1 metrics for the two response groups.
The FLOW metrics performed well in differentiating the two visits as well as the two response groups, the latter only for the pharmacological treatment. With respect to the local metrics, the OUTFLOW per channel performed much worse than INFLOW. OUTFLOW provided fewer in number and magnitude significant p-values that did not provide any strong localization information. On the other hand, INFLOW provided insight about brain region-specific information flow and indicated brain regions whose “causal behavior” is altered by treatment and may potentially serve as indicators of response to treatment.
A significant consideration in our analysis was the heterogeneity of treatment types within the pharmacological and neurostimulation groups. The pharmacological group comprised five subgroups (SNRI, SSRI, NDRI, TCA, and NaSSA), while neurostimulation included rTMS and tDCS. We performed a sensitivity analysis to test for interaction effects between treatment subgroups and the outcome metrics. Repeated measures ANOVA on the two groups separately followed by FDR adjustment over each family and subsequent post-hoc rank based comparison tests revealed that there was no statistically significant difference between the subgroups with respect to causality. While we acknowledge that the absence of statistical significance does not constitute definitive proof of physiological equivalence—given the potential for Type II errors in smaller subgroups—the current analysis did not detect a substantial effect of the heterogeneity. This lack of detectable divergence between the modalities statistically justifies their pooling into a single cohort, consistent with the view that these interventions share convergent downstream effects on neural circuitry. Theoretically, all the different subgroups in pharmacological treatment are known to increase synaptic neurotransmitter availability, such as serotonin, norepinephrine, and dopamine by different mechanisms to prevent the reuptake of the corresponding neurotransmitter into the presynaptic neuron (46). All three neurotransmitters work together to address neurochemical imbalances or neural circuit dysfunction to improve mood, reduce anxiety, and restore emotional equilibrium, thus converging on common downstream circuit-level effects. Similarly, both rTMS and tDCS aim to restore balance in prefrontal–limbic circuitry, albeit through different neurophysiological mechanisms—modulating neurotransmitter release, metabolism, or gene expression but achieving convergent network-level outcomes (47, 48). rTMS (1 Hz right DLPFC): provides inhibitory modulation of overactive right prefrontal regions, indirectly facilitating left prefrontal activity and rebalancing hemispheric asymmetry associated with depression. tDCS (anodal left DLPFC): enhances excitability and connectivity of hypoactive left prefrontal cortex, also aiming to restore network balance. Thus, the first part of this study focused on the changes of information flow metrics due to the overall effect of the pharmacological versus neurostimulation treatment irrespective of subgroups variations.
After one week of pharmacological treatment, we observed a relatively global increase in information flow in band without strong hemispheric bias. This finding complements prior findings of elevated power and phase synchronization post-treatment (49, 50), which enhance signal-to-noise ratio and aligns windows of excitability across distant regions. Together, these changes create stronger and more precisely timed carrier signals that could indicate the observed increase in information flow—quantified by causality analyses. One plausible neurobiological mechanism likely involves restoration of neurotransmitter dynamics in circuits affected by depression. Certain brain regions may exhibit insufficient neurotransmitter activation in MDD, leading to impaired synaptic communication. Pharmacological treatments, such as SSRIs, limit the action of corresponding serotonin reuptake transporters, thereby prolonging the presence of neurotransmitters like serotonin at the synapse. This boosts neuronal signaling and facilitates symptom relief. The resulting enhancement in circuit-level communication may manifest as increased information flow, as observed from visit 1 to 2 in the band for the metrics corresponding to all brain regions. Regionally, the largest inflow increases occurred in the band, with maximal effects at the midline frontal, parietal, and occipital sites. Since MDD is known to disrupt large-scale cortical and subcortical networks spanning multiple lobes of the brain (51, 52), these results suggest treatment exerts its strongest influence on these interconnected systems. Notably, at visit 2, inflow to local sites in the band increased more in right hemisphere regions compared to the left. This pattern supports the view that the right hemisphere is affected more in MDD (53) and suggests that treatment may enhance left hemisphere’s capacity to send regulatory signals to the right.
After one week of neurostimulation treatment, the band distinguished the two visits by showing a statistically significant increase in global information flow from the left hemisphere to the rest of the brain. This pattern aligns with prior findings that focused on frontal channels and reported stimulation-induced changes in band connectivity (54). Regionally, the effects were widespread, with the largest differences observed in the central and posterior temporal areas, alongside significant alterations in the frontal sites—including the left frontopolar, bilateral dorsolateral prefrontal, and midline frontal sites. Physiologically, rTMS and tDCS modulate synaptic plasticity and neural excitability, thereby reshaping functional connectivity by strengthening beneficial neural pathways and weakening maladaptive ones (47, 48). Both approaches excite the hypoactive left prefrontal cortex, which is reflected in the marked increase in left-to-whole-brain information flow in the δ band from visit 1 to 2. Notably, FLOW metrics for both hemispheric outflow showed opposite trends from visit 1 to 2 (Supplementary Materials, Figures 1–5) across most frequency bands, except for , where right-hemisphere increases were present but substantially smaller in effect size compared to the left. This asymmetry aligns with the therapeutic mechanisms of stimulation, which act by selectively enhancing or suppressing neural connections to restore balanced communication patterns, reflected as an increase and decrease in information flow. Unlike the pharmacological group, where nearly all metrics across bands trended upward, neurostimulation produced a more differentiated profile of increases and decreases across regions and frequencies.
The frequency-specific nature of our findings ( for pharmacological, for neurostimulation) suggests that different treatments engage distinct neural oscillatory mechanisms. oscillations are associated with top-down cortical control and GABAergic inhibition (55, 56), while delta oscillations reflect large-scale cortical-subcortical interactions and homeostatic sleep processes (57, 58). This frequency dissociation may explain why some patients respond better to one treatment modality over another, and supports the development of frequency-targeted therapeutic interventions (59, 60).
The baseline analysis at visit 1 revealed that in the α band, patients unlikely to respond to traditional antidepressants exhibited greater total information flow—especially right-to-left outflow and bilateral inflow—than potential responders. At the local level, the significant difference were widespread, the frontal, right parietal, left anterior temporal and right occipital regions showed the largest difference, where the nonresponders showed higher inflow. Our agnostic approach revealed patterns that align with, yet extend, established theories. For example, these findings are consistent with prior reports of right hemisphere hyperactivity (53) and elevated α band activity in depression, often spanning large portions of the cortex (2, 61). This confirms that our data-driven causality analysis captures known pathological asymmetries while providing novel detail regarding the directionality of these disparate flows. Physiologically, the elevated information flow corresponding to the right hemisphere before treatment might indicate that the brain is attempting to compensate by increasing overall connectivity and information transfer. However, this hyperconnectivity appears to be maladaptive rather than beneficial which causes the brain to work harder but with reduced efficiency in supporting functions such as processing negative emotions, engaging in repetitive thought patterns, and managing automatic bodily functions that the right hemisphere struggles with.
Across individual subjects, all six baseline significant metrics correlated negatively with proportional symptom changes (i.e. percentage change in depression severity score from baseline : with Pearson correlation whose percentile bootstrap confidence intervals (CIs) (bootstrap resamples: ) excluded zero, two-sided permutation p-values were small (, permutations), and FDR adjusted p-values across metrics ranged between , while left-to-right flow metrics showed near-zero effects and non-significant p values. The narrow, bootstrap-stable CIs and low permutation p-values (Supplementary Materials, Table 8) suggest that these associations are robust to sampling variability and therefore plausibly reproducible in cohorts with similar characteristics, although independent validation is still required. These results are consistent with group-level findings, where nonresponders exhibited higher baseline connectivity than responders. Together these results suggest that elevated band flow particularly right dominant global measures may indicate a less plastic, maladaptive network configuration that limits pharmacological responsiveness. However, the observed correlations are small-to-moderate in magnitude likely too small to serve as standalone clinical decision tools; therefore as a future work we plan to build a prospective predictive modelling that combines these metrics with other clinical and neurophysiological features, evaluated with proper cross−validation and external validation is required to assess clinical utility and generalizability.
Baseline comparisons between responders and nonresponders to neurostimulation treatment lacked strong statistical significance due to smaller sample size. However, several consistent patterns emerged: responders exhibited higher band total flow and bilateral inflow, higher local inflow across channels, and moderate-to-large effect sizes (Cohen’s , , and , respectively) for the three global metrics and locally ( in the range of 0.643 - 0.910 for Fp1, T3, T5, C4, O2). These effect sizes exceed those in the pharmacological group and mirror prior reports of elevated baseline band activity in stimulation responders (54), suggesting these differences would likely reach significance in larger samples.
The findings of our study complement the existing results in the literature, which have established that several brain regions are affected in MDD, which is reflected in different frequency bands (2). Our frequency-specific findings align with recent MEG evidence showing that beta-band connectivity within inhibitory control networks is particularly relevant for treatment response prediction in MDD (62). The study reported that non-responders exhibited decreased β band connectivity in a left-lateralized frontoparietal network centered on superior parietal gyrus and orbitofrontal cortex—regions we also identified as showing altered causality patterns. This convergence across different methodological approaches (MEG coherence vs. EEG phase-based causality) strengthens the evidence that frequency-specific network disruptions are fundamental to treatment resistance, arguing for multiband integration in future analyses.
The larger effect sizes for right hemisphere inflow metrics between visits for both treatments suggest that therapies restore hyperactive right-sided circuits by channeling healthier signals from the rest of the brain. The hemispheric specificity of our findings gains additional support from recent findings (62), that while all MDD patients showed right-lateralized network disruptions (potentially reflecting general disease pathology), only non-responders showed additional left-lateralized frontoparietal hypoconnectivity. This pattern of bilateral dysfunction in non-responders versus primarily right-sided alterations in responders suggests that preserved left hemisphere compensatory mechanisms may be crucial for treatment response. The convergent evidence across studies suggests a hierarchical model of network dysfunction in treatment resistance. Primary right-hemisphere disruptions (observed in all MDD patients) may represent core pathophysiology, while secondary left-hemisphere decompensation (seen specifically in non-responders) indicates a failure of compensatory mechanisms (63, 64). While we did not find statistically significant differences in left hemisphere outflow between responders and non-responders, our finding that non-responders exhibited significantly greater total flow, right-to-left flow, and bilateral inflow is consistent with this pattern of dysregulated interhemispheric communication. This bilateral breakdown of top-down control (65, 66) may explain why standard antidepressants, which primarily target monoaminergic systems, fail to restore network function in these patients (67, 68).
These findings have important clinical implications for personalized medicine in MDD. The distinct α band baseline signatures in pharmacological nonresponders suggest that EEG-based causality metrics could be incorporated into clinical decision-making algorithms. While single-metric prediction is limited by moderate effect sizes, patients showing high baseline α band information flow, particularly from the right hemisphere, display a physiological profile that suggests they might be better candidates for neurostimulation or combined treatment approaches rather than pharmacotherapy alone. This could reduce the trial-and-error period often associated with antidepressant selection, potentially improving patient outcomes and reducing healthcare costs. An important consideration is the temporal dynamics of treatment response. While clinical response was assessed at 4–6 weeks, our brain connectivity changes were measured at one week. This early timepoint may capture initial neuroplastic changes that precede clinical improvement. The observation that visit 2 metrics did not improve prediction over visit 1 metrics suggests that baseline characteristics may be more informative than early treatment-induced changes for predicting ultimate treatment response. Future studies should include multiple intermediate timepoints to map the trajectory of causality changes throughout treatment.
Study limitations include the absence of explicit statistical tests for hemispheric asymmetry metrics and limited statistical power for baseline responder/nonresponder comparisons in the neurostimulation cohort due to small sample size. Future studies should use larger cohorts, include intermediate timepoints to map causality trajectories, and develop prospective predictive models that combine these metrics with clinical and other neurophysiological features.
This study demonstrates that pharmacological and neurostimulation treatments affect brain information flow differently, with frequency-specific signatures ( for pharmacological, for neurostimulation). We observed significant imbalances in brain connection patterns between responders and nonresponders, with distinct profiles for each treatment modality. The α band baseline differences in pharmacological treatment indicate pretreatment EEG profiles carry predictive value for treatment response. These information flow metrics can serve as valuable mechanistic indicators augmenting personalized therapeutic approaches in MDD, though further validation in larger, independent cohorts is essential for clinical translation.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics statement
The studies involving humans were approved by Ethics Committee of the Prague Psychiatric Centre/National Institute of Mental Health. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.
Author contributions
MBh: Writing – original draft, Writing – review & editing. IV: Writing – original draft, Writing – review & editing. AK: Writing – review & editing. JH: Writing – review & editing. MBr: Writing – review & editing. MBa: Writing – review & editing. MP: Writing – review & editing.
Funding
The author(s) declared that financial support was received for this work and/or its publication. This study is supported by the Czech Science Foundation, Project No. GF21-14727K, and the Czech Academy of Sciences, Praemium Academiae awarded to MP. It was also supported by ERDF-Project Brain dynamics, No. CZ.02.01.01/00/22_008/0004643 and the Czech Science Foundation project No. 23-07074S awarded to J. Hlinka and Charles University Program Cooperatio 38:Neurosciences awarded to MBr.
Conflict of interest
The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
The authors IV, MBr declared that they were an editorial board member of Frontiers, at the time of submission. This had no impact on the peer review process and the final decision.
Generative AI statement
The author(s) declared that generative AI was not used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpsyt.2025.1718216/full#supplementary-material
References
1. Vos T, Lim SS, Abbafati C, Abbas KM, Abbasi M, Abbasifard M, et al. Global burden of 369 diseases and injuries in 204 countries and territories, 1990-2019: a systematic analysis for the global burden of disease study 2019. Lancet. (2020) 396:1204–22. doi: 10.1016/S0140-6736(20)30925-9
2. Mumtaz W, Malik AS, Yasin MAM, and Xia L. Review on EEG and ERP predictive biomarkers for major depressive disorder. Biomed Signal Process Control. (2015) 22:85–98. doi: 10.1016/j.bspc.2015.07.003
3. Kennis M, Gerritsen L, van Dalen M, Williams A, Cuijpers P, and Bockting C. Prospective biomarkers of major depressive disorder: a systematic review and meta-analysis. Mol Psychiatry. (2020) 25:321–38. doi: 10.1038/s41380-019-0585-z
4. Friston KJ. Functional and effective connectivity in neuroimaging: A synthesis. Hum Brain Mapp. (1994) 2:56–78. doi: 10.1002/hbm.460020107
5. Simmatis L, Russo EE, Geraci J, Harmsen IE, and Samuel N. Technical and clinical considerations for electroencephalography-based biomarkers for major depressive disorder. NPJ Ment Health Res. (2023) 2:18. doi: 10.1038/s44184-023-00038-7
6. Paluš M, Kathpalia A, and Brunovský M. (2023). EEG connectivity in treatment of major depressive disorder: Tackling the conductivity effects, in: 2023 14th International Conference on Measurement. Bratislava: Institute of Measurement Science, Slovak Academy of Sciences. pp. 76–9. doi: 10.23919/MEASUREMENT59122.2023.10164615
7. Khan DM, Masroor K, Jailani MFM, Yahya N, Yusoff MZ, and Khan SM. Development of wavelet coherence EEG as a biomarker for diagnosis of major depressive disorder. IEEE Sensors J. (2022) 22:4315–25. doi: 10.1109/JSEN.2022.3143176
8. Ho SI, Lin IM, Hsieh JC, and Yen CF. EEG coherences of the default mode network among patients comorbid with major depressive disorder and anxiety symptoms. J Affect Disord. (2024) 361:728–38. doi: 10.1016/j.jad.2024.06.041
9. Bares M, Brunovsky M, Kopecek M, Stopkova P, Novak T, Kozeny J, et al. Changes in QEEG prefrontal cordance as a predictor of response to antidepressants in patients with treatment resistant depressive disorder: A pilot study. J Psychiatr Res. (2007) 41:319–25. doi: 10.1016/j.jpsychires.2006.06.005
10. Cook IA, Leuchter AF, Morgan M, Witte E, Stubbeman WF, Abrams M, et al. Early changes in prefrontal activity characterize clinical responders to antidepressants. Neuropsychopharmacology. (2002) 27:120–31. doi: 10.1016/S0893-133X(02)00294-4
11. Liu X, Zhang H, Cui Y, Zhao T, Wang B, Xie X, et al. EEG-based major depressive disorder recognition by neural oscillation and asymmetry. Front Neurosci. (2024) 18:1362111. doi: 10.3389/fnins.2024.1362111
12. Bhattacharjee M, Kathpalia A, Brunovský M, and Paluš M. (2023). Phase-based causality analysis of EEG in treatment of major depressive disorder, in: 2023 14th International Conference on Measurement. Bratislava: Institute of Measurement Science, Slovak Academy of Sciences. pp. 84–7. doi: 10.23919/MEASUREMENT59122.2023.10164427
13. Hasanzadeh F, Mohebbi M, and Rostami R. Analysis of EEG-derived brain networks for predicting rTMS treatment outcomes in MDD patients. Biomed Signal Process Control. (2024) 96:106613. doi: 10.1016/j.bspc.2024.106613
14. Zhongwen J, Lihan T, Jidong L, Linhong D, and Ling Z. Depression-induced changes in directed functional brain networks: A source-space resting-state EEG study. Math Biosci Eng. (2024) 21:7124–38. doi: 10.3934/mbe.2024315
15. Sun C, Yang F, Wang C, Wang Z, Zhang Y, Ming D, et al. Mutual information-based brain network analysis in post-stroke patients with different levels of depression. Front Hum Neurosci. (2018) 12:285. doi: 10.3389/fnhum.2018.00285
16. Hlaváčková-Schindler K, Paluš M, Vejmelka M, and Bhattacharya J. Causality detection based on information-theoretic approaches in time series analysis. Phys Rep. (2007) 441:1–46. doi: 10.1016/j.physrep.2006.12.004
17. Bossomaier T, Barnett L, Harré M, and Lizier JT. An Introduction to Transfer Entropy: Information Flow in Complex Systems. Cham, Switzerland: Springer International Publishing (2016). doi: 10.1007/978-3-319-43222-9
18. Wibral M, Vicente R, and Lindner M. Transfer entropy in neuroscience. In: Wibral M, Vicente R, and Lizier JT, editors. Directed Information Measures in Neuroscience. Springer, Berlin, Heidelberg (2014). p. 3–36. doi: 10.1007/978-3-642-54474-31
19. Vlachos I, Kugiumtzis D, and Paluš M. Phase-based causality analysis with partial mutual information from mixed embedding. Chaos: Interdiscip J Nonlinear Sci. (2022) 32:053111. doi: 10.1063/5.0087910
20. Vlachos I and Kugiumtzis D. Nonuniform state-space reconstruction and coupling detection. Phys Rev E. (2010) 82:16207. doi: 10.1103/PhysRevE.82.016207
21. Kuramoto Y. Chemical Oscillations, Waves, and Turbulence Vol. 19. Springer Series in Synergetics. Berlin, Heidelberg: Springer-Verlag (1984). doi: 10.1007/978-3-642-69689-3
22. Vlachos I, Kugiumtzis D, and Paluš M. Causality from phases of high-dimensional nonlinear systems. Inf Sci. (2025) 697:121761. doi: 10.1016/j.ins.2024.121761
23. Paluš M and Stefanovska A. Direction of coupling from phases of interacting oscillators: An information theoretic approach. Phys Rev E. (2003) 67:55201. doi: 10.1103/PhysRevE.67.055201
24. Oppenheim AV and Schafer RW. Discrete-Time Signal Processing. 3rd edn. Cambridge, USA: Prentice Hall Press (2009).
25. Pikovsky A, Rosenblum M, and Kurths J. Synchronization: A Universal Concept in Nonlinear Sciences. Cambridge Nonlinear Science Series. Cambridge, UK: Cambridge University Press (2001).
26. Rössler O. An equation for continuous chaos. Phys Lett A. (1976) 57:397–8. doi: 10.1016/0375-9601(76)90101-8
27. Osipov GV, Pikovsky AS, Rosenblum MG, and Kurths J. Phase synchronization effects in a lattice of nonidentical rössler oscillators. Phys Rev E. (1997) 55:2353–61. doi: 10.1103/PhysRevE.55.2353
28. Montgomery SA and Å sberg M. A new depression scale designed to be sensitive to change. Br J Psychiatry. (1979) 134:382–9. doi: 10.1192/bjp.134.4.382
29. Williams JBW and Kobak KA. Development and reliability of a structured interview guide for the montgomery-åsberg depression rating scale (SIGMA). Br J Psychiatry. (2008) 192:52–8. doi: 10.1192/bjp.bp.106.032532
30. Bares M, Kopecek M, Novak T, Stopkova P, Sos P, Kozeny J, et al. Low frequency (1-Hz), right prefrontal repetitive transcranial magnetic stimulation (rTMS) compared with venlafaxine ER in the treatment of resistant depression: A double-blind, single-centre, randomized study. J Affect Disord. (2009) 118:94–100. doi: 10.1016/j.jad.2009.01.032
31. Bares M, Brunovsky M, Novak T, Kopecek M, Stopkova P, Sos P, et al. The change of prefrontal QEEG theta cordance as a predictor of response to bupropion treatment in patients who had failed to respond to previous antidepressant treatments. Eur Neuropsychopharmacol. (2010) 20:459–66. doi: 10.1016/j.euroneuro.2010.03.007
32. Bares M, Brunovsky M, Novak T, Kopecek M, Stopkova P, Sos P, et al. QEEG theta cordance in the prediction of treatment outcome to prefrontal repetitive transcranial magnetic stimulation or venlafaxine ER in patients with major depressive disorder. Clin EEG Neurosci. (2015) 46:73–80. doi: 10.1177/1550059413520442
33. Bares M, Novak T, Kopecek M, Brunovsky M, Stopkova P, and Höschl C. The effectiveness of prefrontal theta cordance and early reduction of depressive symptoms in the prediction of antidepressant treatment outcome in patients with resistant depression: analysis of naturalistic data. Eur Arch Psychiatry Clin Neurosci. (2015) 265:73–82. doi: 10.1007/s00406-014-0506-8
34. Bares M, Novak T, Vlcek P, Hejzlar M, and Brunovsky M. Early change of prefrontal theta cordance and occipital alpha asymmetry in the prediction of responses to antidepressants. Int J Psychophysiol. (2019) 143:1–8. doi: 10.1016/j.ijpsycho.2019.06.006
35. Vlcek P, Bares M, Novak T, and Brunovsky M. Baseline difference in quantitative electroencephalography variables between responders and non-responders to low-frequency repetitive transcranial magnetic stimulation in depression. Front Psychiatry. (2020) 11:83. doi: 10.3389/fpsyt.2020.00083
36. Bares M, Brunovsky M, Stopkova P, Hejzlar M, and Novak T. Transcranial direct-current stimulation (tDCS) versus venlafaxine ER in the treatment of depression: A randomized, double-blind, single-center study with open-label, follow-up. Neuropsychiatr Dis Treat. (2019) 15:3003–14. doi: 10.2147/NDT.S226577
37. Jasper H. The ten-twenty electrode system of the international federation. Electroencephalography Clin Neurophysiol. (1958) 10:371–5.
38. Davoodi A, Holeňa M, Brunovský M, Kathpalia A, Hlinka J, Bareš M, et al. Response prediction of antidepressants: Using graph theory tools for brain network connectivity analysis. Biomed Signal Process Control. (2025) 103:107362. doi: 10.1016/j.bspc.2024.107362
39. Davis CS. Statistical Methods for the Analysis of Repeated Measurements. New York: Springer (2002).
40. Cohen J. Statistical power analysis for the behavioral sciences. New York, USA: Routledge (2013).
41. Kreyszig E. Introductory Mathematical Statistics. New York, USA: John Wiley & Sons (1970). p. 206.
42. García-Pérez MA. Use and misuse of corrections for multiple testing. Methods Psychol. (2023) 8:100120. doi: 10.1016/j.metip.2023.100120
43. Rubin M. Inconsistent multiple testing corrections: The fallacy of using family-based error rates to make inferences about individual hypotheses. Methods Psychol. (2024) 10:100140. doi: 10.1016/j.metip.2024.100140
44. Barnett MJ, Doroudgar S, Khosraviani V, and Ip EJ. Multiple comparisons: To compare or not to compare, that is the question. Res Soc Administrative Pharm. (2022) 18:2331–4. doi: 10.1016/j.sapharm.2021.07.006
45. Benjamini Y and Yekutieli D. The control of the false discovery rate in multiple testing under dependency. Ann Stat. (2001) 29:1165–88. doi: 10.1214/aos/1013699998
46. Harmer CJ, Duman RS, and Cowen PJ. How do antidepressants work? new perspectives for refining future treatment approaches. Lancet Psychiatry. (2017) 4:409–18. doi: 10.1016/S2215-0366(17)30015-9
47. Chervyakov AV, Chernyavsky AY, Sinitsyn DO, and Piradov MA. Possible mechanisms underlying the therapeutic effects of transcranial magnetic stimulation. Front Hum Neurosci. (2015) 9:303. doi: 10.3389/fnhum.2015.00303
48. Yamada Y and Sumiyoshi T. Neurobiological mechanisms of transcranial direct current stimulation for psychiatric disorders; neurophysiological, chemical, and anatomical considerations. Front Hum Neurosci. (2021) 15:631838. doi: 10.3389/fnhum.2021.631838
49. Nissen TD, Laursen B, Viardot G, l’Hostis P, Danjou P, Sluth LB, et al. Effects of vortioxetine and escitalopram on electroencephalographic recordings – a randomized, crossover trial in healthy males. Neuroscience. (2020) 424:172–81. doi: 10.1016/j.neuroscience.2019.09.039
50. Olbrich S, Tränkner A, Chittka T, Hegerl U, and Schönknecht P. Functional connectivity in major depression: Increased phase synchronization between frontal cortical EEG-source estimates. Psychiatry Research: Neuroimaging. (2014) 222:91–9. doi: 10.1016/j.pscychresns.2014.02.010
51. Fingelkurts AA, Fingelkurts AA, Rytsälä H, Suominen K, Isometsä E, and Kähkönen S. Composition of brain oscillations in ongoing EEG during major depression disorder. Neurosci Res. (2006) 56:133–44. doi: 10.1016/j.neures.2006.06.006
52. Fingelkurts AA and Fingelkurts AA. Altered structure of dynamic electroencephalogram oscillatory pattern in major depression. Biol Psychiatry. (2015) 77:1050–60. doi: 10.1016/j.biopsych.2014.12.011
53. Hecht D. Depression and the hyperactive right-hemisphere. Neurosci Res. (2010) 68:77–87. doi: 10.1016/j.neures.2010.06.013
54. Nobakhsh B, Shalbaf A, Rostami R, Kazemi R, Rezaei E, and Shalbaf R. An effective brain connectivity technique to predict repetitive transcranial magnetic stimulation outcome for major depressive disorder patients using EEG signals. Phys Eng Sci Med. (2023) 46:67–81. doi: 10.1007/s13246-022-01198-0
55. Engel AK and Fries P. Beta-band oscillations—signalling the status quo? Curr Opin Neurobiol. (2010) 20:156–65. doi: 10.1016/j.conb.2010.02.015
56. Jensen O, Goel P, Kopell N, Pohja M, Hari R, and Ermentrout B. On the human sensorimotor-cortex beta rhythm: Sources and modeling. NeuroImage. (2005) 26:347–55. doi: 10.1016/j.neuroimage.2005.02.008
57. Knyazev GG. EEG delta oscillations as a correlate of basic homeostatic and motivational processes. Neurosci Biobehav Rev. (2012) 36:677–95. doi: 10.1016/j.neubiorev.2011.10.002
58. Harmony T. The functional significance of delta oscillations in cognitive processing. Front Integr Neurosci. (2013) 7:3389/fnint.2013.00083. doi: 10.3389/fnint.2013.00083
59. Herrmann CS, Strüber D, Helfrich RF, and Engel AK. EEG oscillations: From correlation to causality. Int J Psychophysiol. (2016) 103:12–21. doi: 10.1016/j.ijpsycho.2015.02.003
60. Riddle J, Scimeca JM, Cellier D, Dhanani S, and D’Esposito M. Causal evidence for a role of theta and alpha oscillations in the control of working memory. Curr Biol. (2020) 30:1748–1754.e4. doi: 10.1016/j.cub.2020.02.065
61. Pollock V and Schneider L. Quantitative, waking EEG research on depression. Biol Psychiatry. (1990) 27:757–80. doi: 10.1016/0006-3223(90)90591-O
62. Liu H, Xia Y, Wang X, Cai S, Hua L, Yao Z, et al. Large-scale neural network correlates of response inhibition associated with antidepressant response in major depressive disorder. BMC Psychiatry. (2025) 25:866. doi: 10.1186/s12888-025-07286-1
63. Grimm S, Beck J, Schuepbach D, Hell D, Boesiger P, Bermpohl F, et al. Imbalance between left and right dorsolateral prefrontal cortex in major depression is linked to negative emotional judgment: An fMRI study in severe major depressive disorder. Biol Psychiatry. (2008) 63:369–76. doi: 10.1016/j.biopsych.2007.05.033
64. van der Vinne N, Vollebregt MA, van Putten MJ, and Arns M. Frontal alpha asymmetry as a diagnostic marker in depression: Fact or fiction? a meta-analysis. NeuroImage: Clin. (2017) 16:79–87. doi: 10.1016/j.nicl.2017.07.006
65. Disner SG, Beevers CG, Haigh EAP, and Beck AT. Neural mechanisms of the cognitive model of depression. Nat Rev Neurosci. (2011) 12:467–77. doi: 10.1038/nrn3027
66. Kaiser RH, Andrews-Hanna JR, Wager TD, and Pizzagalli DA. Large-scale network dysfunction in major depressive disorder: A meta-analysis of resting-state functional connectivity. JAMA Psychiatry. (2015) 72:603–11. doi: 10.1001/jamapsychiatry.2015.0071
67. Dichter GS, Gibbs D, and Smoski MJ. A systematic review of relations between resting-state functional-MRI and treatment response in major depressive disorder. J Affect Disord. (2015) 172:8–17. doi: 10.1016/j.jad.2014.09.028
Keywords: brain, causality, electroencephalography, major depressive disorder, mutual information
Citation: Bhattacharjee M, Vlachos I, Kathpalia A, Hlinka J, Brunovský M, Bareš M and Paluš M (2026) Brain causality alterations in major depressive disorder treatment. Front. Psychiatry 16:1718216. doi: 10.3389/fpsyt.2025.1718216
Received: 03 October 2025; Accepted: 12 December 2025; Revised: 11 December 2025;
Published: 27 January 2026.
Edited by:
Luca Tarasi, University of Bologna, ItalyReviewed by:
Enkhmurun Chibaatar, University of Occupational and Environmental Health Japan, JapanYingtan Wang, Capital Medical University, China
Copyright © 2026 Bhattacharjee, Vlachos, Kathpalia, Hlinka, Brunovský, Bareš and Paluš. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Madhurima Bhattacharjee, YmhhdHRhY2hhcmplZUBjcy5jYXMuY3o=
†Present address: Aditi Kathpalia, Department of Applied Mechanics and Biomedical Engineering, Indian Institute of Technology Madras, Chennai, India