Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Psychiatry, 11 February 2026

Sec. Mood Disorders

Volume 17 - 2026 | https://doi.org/10.3389/fpsyt.2026.1769246

This article is part of the Research TopicTreatment Resistant Depression (TRD): epidemiology, clinic, burden and treatment, vol IIView all 9 articles

Episode-wide Maudsley staging in treatment-resistant depression: a longitudinal tertiary-care comparison of dimensional and categorical outcomes

  • 12nd Department of Psychiatry, National and Kapodistrian University of Athens, “Attikon” University General Hospital, Athens, Greece
  • 2Department of Brain Sciences, Faculty of Medicine, Imperial College London, London, United Kingdom
  • 3West London NHS Trust, London, United Kingdom
  • 41st Department of Psychiatry, National and Kapodistrian University of Athens, Eginition Hospital, Athens, Greece
  • 5Université Libre de Bruxelles, Brussels, Belgium
  • 6Department of Medicine and Surgery, Kore University of Enna, Enna, Italy
  • 7Oasi Research Institute-IRCCS, Troina, Italy
  • 8Institute of Psychiatry, Psychology & Neuroscience, King’s College London, London, United Kingdom
  • 9South London and Maudsley NHS Foundation Trust, London, United Kingdom

Background: Unlike typical categorical definitions of treatment-resistant depression (TRD) based on antidepressant (AD) trial failures, the Maudsley Staging Method (MSM) is a dimensional measure of resistance also rating episode duration, baseline depression severity and failure of augmentation strategies and ECT. However, MSM has not previously been used longitudinally across the full depressive episode, from episode onset to remission. We applied episode-wide MSM (EW-MSM) in patients with Major Depressive Disorder naturalistically followed to remission in a tertiary-care setting and categorized as 1st and 2nd AD trial remitters, TRD remitters and TRD non-remitters (categorical outcomes). The study aimed to investigate clinicodemographic and treatment-related correlates of EW-MSM and categorical outcomes, comparatively assess their predictive value for depression improvement, and explore the discriminative utility of EW-MSM across categorical outcomes.

Methods: We recruited 267 patients. EW-MSM was scored at remission (MADRS ≤ 9 in two consecutive visits), if achieved. Associations of clinicodemographic and treatment-related characteristics with EW-MSM and categorical outcomes were explored in multivariate models. Their comparative predictive value for depression improvement was tested in hierarchical linear regressions. ROC curves assessed EW-MSM discriminative utility across categorical outcomes.

Results: Analysis focused on 233 remitters (105 1st AD trial remitters, 62 2nd AD trial remitters and 66 TRD remitters). Both EW-MSM and categorical outcomes were associated with baseline severity, obsessive-compulsive disorder comorbidity, episode duration, number of ADs or AD combination. EW-MSM was additionally associated with psychotic features and use of augmentation strategies. EW-MSM was more strongly associated with baseline severity than categorical outcomes (R2 = 0.24 vs. 0.14) and consequently predicted depression improvement to remission more efficiently in hierarchical regressions (ΔR2 = 0.13 vs. 0.05). EW-MSM discriminated well TRD remitters from 1st AD trial remitters (AUC = 0.91, 95% CI = 0.86-0.95) or from early remitters combined (AUC = 0.865, 95% CI = 0.81-0.92) but not among adjacent categories.

Limitations: Single, tertiary-care setting, unavailability of ECT/esketamine and exclusion of patients on psychotherapy (not rated by MSM) limit study generalizability.

Conclusions: EW-MSM reflects clinical and treatment intensity aspects of resistance in depression more strongly and comprehensively than categorical outcomes. Therefore, it offers precision for staging and could be used to more efficiently investigate clinical, biological and psychological correlates of resistance.

1 Introduction

Major Depressive Disorder (MDD) is among the most prevalent and disabling psychiatric conditions worldwide (1). Despite numerous available evidence-based pharmacological treatments, a substantial proportion of patients do not achieve satisfactory outcomes and develop treatment-resistant depression (TRD) (2, 3). Compared to MDD, TRD is associated with higher rates of psychiatric and physical comorbidities, greater functional impairment, increased likelihood of hospitalization, longer hospital stays, higher healthcare costs, and an elevated risk of suicidality (411).

Despite its clinical relevance, a consensual operational definition and diagnostic criteria for TRD are lacking (1214). There is no solid evidence validating any meaningful cut-off based on the number of treatment failures for defining a binary TRD phenotype and a spectrum of resistance seems more plausible (15, 16). Yet, according to the European Medicines Agency (EMA), TRD is considered when treatment with at least two different antidepressant agents (of the same or a different class) of adequate dosage and duration shows lack of clinically meaningful improvement (17). Based on this statement, TRD is frequently defined as failure to respond to two adequate antidepressant (AD) trials. Using this definition, approximately 30% of patients with MDD develop TRD (2, 6).

However, not all responders finally achieve the ultimate therapeutic goal, which is clinical remission, i.e. near-absence of depressive symptoms (18). Therefore, if TRD is defined as lack of clinical remission rather than lack of response after two AD trials, its estimated prevalence rises considerably. For example, in the STAR*D study (19), after the first two trials around 45% of patients cumulatively failed to reach remission (20). Full restoration of psychosocial and cognitive functioning (functional recovery), recently recognized as the optimal goal of treatment, typically takes much longer to achieve, and its measurement varies widely across settings (21). Therefore, clinical remission, signifying the end of the depressive episode, appears as a reasonable preferable clinical anchor on which definition of resistance should be based (22).

Apart from binary definitions of TRD, such as lack of response or remission after two AD trials, which are often used in clinical practice as they are simpler and adopted by regulation authorities, dimensional and staging models have also been proposed to describe multiple dimensions of treatment resistance or gradations in its severity, respectively, rather than simply presence or absence of treatment resistance (13, 14, 23). Staging frameworks include the Thase and Rush model (24), which describes five stages of resistance depending on the pharmacological AD classes sequentially failed and assuming a hierarchy of ADs, and the European Staging Model (ESM), proposed by the Group for the Study of Resistant Depression (GSRD), which describes three main stages of resistance depending on the number and duration of failed AD trials (25, 26).

The Maudsley Staging Method (MSM) is both a dimensional and a staging tool to assess treatment resistance in MDD (27). In addition to AD failures, MSM also rates failure of any augmentation strategies and electroconvulsive therapy (ECT). Unlike earlier staging models, MSM uses lack of remission rather than lack of response to define treatment failures (22). Moreover, it bases assessment of resistance not only on treatment failures but also two additional dimensions, duration of current episode and baseline depression severity. The total MSM score also allows for the classification of patients into mild, moderate or severe stages of resistance. The model has been validated in multiple studies, demonstrating its predictive value for poor prospective clinical outcomes, including increased functional impairment, time spent with depression and lower chance of remission (22, 2832). By integrating both treatment intensity and clinical burden (depression severity and episode duration), this remission-based dimensional staging model provides additional benefits for treatment resistance quantification both in research and clinical settings.

Despite growing interest in TRD, there is a notable lack of longitudinal, real-world studies following up TRD patients to remission in naturalistic settings. Most existing studies prioritize treatment response as the primary outcome and often have limited follow-up durations, rarely exceeding two years. Regarding MSM, it has been applied only at random snapshots, e.g. at hospital discharge, to predict short- or long-term outcomes. To date, no studies have used MSM to quantify the full continuum of treatment resistance across the Major Depressive Episode (MDE) ending at remission, i.e. episode-wide. Capturing the total burden of treatment failures and time elapsing from episode onset to remission as well as baseline clinical severity in a dimensional framework, such as an episode-wide version of MSM, produces an accurate, continuous index of resistance observed over the whole episode. This index is not an estimate or prediction but a post-hoc record of all treatment efforts required and time needed to reach remission given the initial severity. Therefore, it can serve as a gold standard to compare with yet non-validated categorical classifications, such as remitters, non-remitters or TRD. Furthermore, it might be tested for correlation with candidate biological, clinical or psychological predictors.

This prospective observational study included adult patients with a moderate to severe MDE in the context of MDD who were naturalistically followed up at a specialized tertiary-care center aiming at remission. It was part of the ATLADIS (AThens Longitudinal Affective DIsorders Study) prospective study also including patients with bipolar disorder (33, 34). Using the GSRD/ESM categorical TRD definition but adopting a remission-based framework, patients were categorized as 1st AD trial remitters, 2nd AD trial remitters, TRD remitters and TRD non-remitters (categorical outcomes). Episode-wide MSM (EW-MSM) was rated at the end of the MDE (on remission, if achieved) to produce a continuous score of resistance of the MDE from its beginning to its end. The study aimed to:

1. Examine the association of sociodemographic and clinical characteristics, baseline mood severity, and current MDE treatment parameters with categorical outcomes and EW-MSM scores.

2. Compare the predictive value of EW-MSM versus categorical outcomes for improvement in depression severity.

3. Evaluate the discriminative utility of EW-MSM across categorical outcomes.

2 Methods

2.1 Participants

Included patients were diagnosed with MDD as the primary psychiatric diagnosis and suffered from a current MDE, according to Diagnostic and Statistical Manual of Mental Disorders (DSM)-5 criteria, as assessed with the Structured Clinical Interview for DSM-5 Disorders - Clinician Version (SCID-5-CV) by an experienced psychiatrist (35). An at least moderate severity of the MDE was required for inclusion as documented by a Montgomery Åsberg Depression Rating Scale (MADRS) (36) score ≥20 (37). Patients were recruited from the specialized tertiary-care Affective Disorders and Suicidality outpatient clinic at the 2nd Department of Psychiatry, Medical School of National and Kapodistrian University of Athens, located in Attikon University Hospital, Athens, Greece between January 2014 and June 2023. Follow-up at the clinic started either before or after the onset of the current MDE. Patients with neurocognitive disorders (delirium, dementia, traumatic brain injury, etc.), intellectual disability (IQ<70), lack of fluency in Greek, severe comorbid psychiatric disorders (alcohol and substance abuse during the last 6 months, severe personality disorder) and severe uncontrolled medical conditions (e.g. neoplasms) were excluded. As MSM does not rate 0 ADs, only patients treated with at least one AD at an adequate dose for a minimum of four weeks were included. As MSM does not include psychotherapy, patients who received psychotherapy for the current episode (independent of whether they did so in previous episodes) were also excluded.

The study protocol was approved by the local Ethics Committee and all procedures adhered to the principles of the Declaration of Helsinki. Written informed consent was obtained from all participants.

2.2 Treatment procedures

Patients were naturalistically followed up every 1–3 months, or more often when clinically justified (suicidality, poor AD tolerability, hypomanic switch). Monotherapy, combination therapy and augmentation strategies were freely allowed based on clinical judgement and established guidelines, aiming to achieve remission. ECT, repetitive transcranial magnetic stimulation (rTMS) or intranasal esketamine were not available in our setting. ADs were recorded only if administered for at least four weeks on at least the minimum recommended dose(s). An AD trial was defined as a period of treatment with ADs, one or more (AD combination trial), of at least minimum dosage and duration, and any augmentation agents. Augmentation strategies, typically involving the addition of antipsychotics and mood stabilizers, were recorded if lasting at least four weeks and were considered part of an AD trial rather than a separate trial. The addition of benzodiazepines or hypnotics was not classified as augmentation.

Criteria of unsatisfactory outcome triggering decisions to change treatment (dose optimization, augmentation, combination or switching) consistently included:

- No response (<25% reduction in MADRS score) after ≥4 weeks of adequate treatment.

- Partial response (25–49% reduction in MADRS score) after ≥6 weeks of adequate treatment.

- Lack of remission (MADRS ≥10) after ≥12 weeks of adequate treatment.

- Earlier treatment modifications in cases of poor tolerability.

However, treatment decisions were individualized, with no predefined criteria guiding the selection of specific medications or treatment adjustments.

All administered treatments after the onset of the current MDE were thoroughly recorded for the whole episode and also specifically extracted for the first and last AD trials. Current MDE treatment characteristics (number of ADs and AD trials used, AD combinations and augmentation strategies employed) were extracted. The maximum AD doses received for at least 4 weeks episode-wise and in first and last AD trials were also extracted. AD doses were also converted to fluoxetine dose equivalents (38, 39).

The study ended in June 2024. For any individual patient, participation in the study was completed when remission was reached or, before remission was reached, when the study ended or the patient dropped out. The last two options were applicable only for TRD patients, i.e., those with 3 or more AD trials. Hence, patients who ceased follow-up early, after the 1st or 2nd AD trial without achieving remission were excluded from the analysis. The total duration of current MDE was determined from episode onset to remission, study end if earlier or drop out.

2.3 Clinical assessments

Clinicodemographic (sociodemographic and clinical) data were obtained through semi-structured clinical interviews conducted by an experienced psychiatrist or clinical psychologist. Follow-up of the current MDE always started after its onset, which was determined based on patient self-reports, caregivers’ information and medical records after assessing their reliability; conflicts were resolved by contacting patients’ former therapists. Baseline MADRS and Young Mania Rating Scale (YMRS) (40) scores were retrospectively assessed (R-MADRS, R-YMRS) to evaluate symptom severity at episode onset using the same procedures. MADRS was also rated when follow-up of the current MDE started (First evaluation MADRS). Mood state monitoring was primarily based on the Clinical Global Impressions Scale – Severity (CGI-S) (41), with optional MADRS assessments, when justified by clinical circumstances, and YMRS assessments, when hypomania was suspected (YMRS ≥ 12). When CGI-S reached 1 or 2, remission was tested using MADRS (≤9) and YMRS (≤6) cut-offs in two consecutive visits (42, 43). Δ-MADRS was calculated as baseline MADRS (R-MADRS) – remission MADRS.

MSM (27) assesses three core dimensions of treatment resistance: duration of current episode (acute, sub-acute or chronic; 1–3 points), depressive symptom severity (subsyndromal, mild, moderate, severe without or with psychosis; 1–5 points) and the extent of treatment failure (1–7 points), which includes the number of AD failures (1–5 points), as well as whether augmentation strategies and ECT were used (1 point each if used). Scoring of the ECT item was not used in this study as ECT was not available in our setting. Apart from the scores of the three dimensions, the total MSM score, ranging from 3 to 15, is a single, continuous, dimensional measure of resistance, also allowing for the classification of patients into mild (scores 3-6), moderate (scores 7-10) or severe (scores 11-15) stages of resistance. In our study, EW-MSM was scored at the end of the episode, upon remission. Of note, scoring was based on the total number of ADs received during the episode, rather than treatment failures alone, as previously applied (29).

Furthermore, categorical outcomes were defined after modifying the GSRD categorical definition, distinguishing participants into 1st AD trial remitters, 2nd AD trial remitters, TRD remitters (after at least three AD trials) and TRD non-remitters (26).

2.4 Statistical analyses

Descriptive statistics were calculated for all variables in the total sample and across categorical outcomes. Normality was assessed using Shapiro – Wilk test. Univariate comparisons between groups were performed using Kruskal-Wallis, Pearson’s chi-square or Fisher’s exact test as appropriate. Univariate linear regressions were also conducted for EW-MSM total score to examine its associations with clinicodemographic characteristics, baseline mood severity and current MDE treatment characteristics.

Variables with p<0.1 in univariate comparisons were included in multivariate multinomial logistic regression models, assessing categorical outcomes in relation to clinicodemographic characteristics and baseline mood severity as well as current MDE treatment characteristics. Additionally, multiple linear regression models were constructed to examine the relationship between EW-MSM total score and the same sets of predictors, incorporating variables identified in univariate regressions (p<0.1). The variance explained by final multivariate models was estimated by R2 in linear regression models and by Cox-Snell pseudo-R2 in logistic regression models.

Hierarchical linear regression analyses were conducted to assess the added explanatory value of EW-MSM total score versus categorical outcomes while predicting Δ-MADRS, adjusting for sex and age. Inference was based on model comparisons.

The potential of EW-MSM in discriminating between categorical outcomes was investigated in Receiver Operating Characteristic (ROC) curves analyses, using categorical outcomes as reference variables and EW-MSM total score as classifier. The areas under the curves (AUCs) and their 95% CI were computed.

All statistical analyses were performed using STATA 14.0 (44).

3 Results

3.1 Study flow

No patients refused to participate. A total of 295 participants were initially enrolled. Twelve participants received adjunctive psychotherapy during the current episode and were hence excluded. Two additional patients were excluded as they received only quetiapine but no AD during the current episode. Furthermore, 11 individuals discontinued follow-up after the 1st or 2nd AD trial before achieving remission, therefore they could not be classified into any categorical outcome. Three individuals experienced a hypomanic episode (‘switch’) during AD treatment, necessitating a revision of their diagnosis. All 14 aforementioned individuals were excluded from the study, leaving 267 eligible participants (Figure 1). Patients on psychotherapy were not significantly different from the finally eligible participants on gender (p = 0.85), age (p = 0.38), educational attainment (p = 0.70) and R-MADRS (p = 0.31). At study end, no TRD patient had dropped out and 233 out of 267 (87.3%) patients (all 167 non-TRD and 66 TRD patients) had achieved remission. Of note, 105 participants (39.3%) were also included in the GSRD TRD-III (45) and TRD-IV (46) studies.

Figure 1
Flowchart showing study group allocation from an initial sample of 295 participants, with 28 excluded for various reasons. Of the remaining, 233 remitters were divided into group A (first antidepressant trial remitters, N equals 105) and group B (second antidepressant trial remitters, N equals 62). One hundred treatment-resistant depression patients were divided into group C1 (remitters, N equals 66) and group C2 (non-remitters, N equals 34).

Figure 1. Overall participant flow. AD, Antidepressant; TRD, Treatment-resistant Depression.

The primary analyses focused on the 233 participants who achieved remission following one (group A, 1st AD trial remitters: N = 105, 45.1%), two (group B, 2nd AD trial remitters: N = 62, 26.6%) and three or more AD trials (group C1, TRD remitters: N = 66, 28.3%). Thirty-four TRD participants did not reach remission by the end of the study (group C2, TRD non-remitters). TRD remitters (C1) and TRD non-remitters (C2) did not significantly differ on clinicodemographic characteristics, baseline mood severity (R-MADRS, R-YMRS) and most current MDE treatment characteristics (Supplementary Tables 1, 2). The two groups differed, however, in that TRD non-remitters received treatment for longer (p = 0.004) but had a lower likelihood of having received AD combination treatment (p = 0.021).

3.2 Descriptives of the total remitters sample (N = 233)

Mean age of remitters (N = 233) was 52.3 ± 12.5 years and 62.7% were females. Among them, 73% suffered from recurrent MDD, while 63.9% and 23.6% exhibited lifetime melancholic and psychotic features, respectively. Furthermore, 30.9% had experienced a comorbid anxiety disorder and 51.5% had lifetime suicide attempts (Supplementary Table 3). Mean R-MADRS was 40.6 ± 5.7, mean remission MADRS was 5.5 ± 2.5 and mean Δ-MADRS was 35.1 ± 5.8 (Table 1, Supplementary Table 3). Median current MDE duration was 7 (range 2-120) months (Table 2). In those starting follow-up at our clinic after current MDE onset (N = 154, 66.1%), median time of initiation of follow-up of the current MDE was 3 (range 1-96) months after current MDE onset, while in those already followed up at our clinic before current MDE onset (N = 79, 33.9%), follow-up of the current MDE started one month (N = 39) or earlier after current MDE onset. Finally, 82% of remitters were hospitalized during their current MDE.

Table 1
www.frontiersin.org

Table 1. Descriptive statistics and univariate analyses of clinicodemographic data and depression severity ratings in remitters (N = 233).

Table 2
www.frontiersin.org

Table 2. Descriptive statistics and univariate analyses of current MDE treatment characteristics in remitters (N = 233).

3.3 Aim 1: clinicodemographic characteristics, baseline mood severity and current MDE treatment parameters as predictors of EW-MSM and categorical outcomes

Clinicodemographic characteristics, mood severity ratings and current MDE treatment characteristics of remitters across categorical outcomes are presented in Supplementary Table 3 and Table 2, alongside results from univariate regressions of EW-MSM on the same predictors. Sex, age, and clinicodemographic characteristics and baseline mood ratings with p < 0.1 in the univariate analyses (comparisons and regressions), displayed in Table 1, were included as predictors of EW-MSM total score and categorical outcomes in multiple linear and multinomial logistic regression models, respectively (Table 3). Similar models were built with current MDE treatment characteristics with p<0.1 in univariate analyses (Table 3). Prior to model construction, collinearity tests were conducted, and the most informative predictors were retained. Lifetime and current episode psychotic features were collinear, and the latter was retained. R-MADRS and first evaluation MADRS were also collinear; the former was retained. Similarly, fluoxetine dose equivalents and number of AD trials were collinear with number of ADs used and were, therefore, excluded from multivariate models.

Table 3
www.frontiersin.org

Table 3. Multiple linear regressions of EW-MSM total score and multinomial logistic regressions of categorical outcomes in relation to clinicodemographic characteristics, baseline mood severity and current MDE treatment characteristics in remitters (N = 233).

Clinicodemographic characteristics and baseline mood severity accounted for 33% of the variance of EW-MSM, but only current psychotic features, lifetime OCD comorbidity and R-MADRS emerged as significant predictors. Clinicodemographic characteristics and baseline mood severity explained 20% of the variance of categorical outcomes, but only lifetime OCD comorbidity, family history of MDD in first-degree relatives and R-MADRS were identified as significant predictors of both 2nd AD trial remitters and TRD remitters relative to 1st AD trial remitters.

Current MDE treatment characteristics accounted for 76.7% of the variance of EW-MSM, with current MDE duration, total number of ADs received and augmentation in any trial being significant predictors. Finally, current MDE treatment characteristics explained 45.6% of the variance of categorical outcomes, with current MDE duration and AD combination in any trial being significant predictors.

Therefore, EW-MSM and categorical outcomes were predicted by shared and distinct sets of clinicodemographic and treatment-related characteristics in multivariate models. Predictions were always more efficient for EW-MSM.

3.4 Aim 2: comparing the predictive value of EW-MSM versus categorical outcomes for improvement in depression severity

A hierarchical regression analysis was conducted to examine the comparative predictive value of EW-MSM total score and categorical outcomes for Δ-MADRS scores (Table 4). In the first model, age and sex accounted for 5.1% of the variance in Δ-MADRS scores (R²=0.051, p = 0.003). Adding categorical outcomes in the second model significantly improved model fit (ΔR²=0.066, p < 0.001). In the third model, EW-MSM total score was introduced to the first model, leading to a substantial increase in explained variance (ΔR²=0.143, p < 0.001). The final model included both categorical outcomes and EW-MSM apart from sex and age and accounted for 24.5% of the variance in Δ-MADRS scores (p < 0.001). Compared to model 2, EW-MSM total score improved R² by 0.127 (p<0.001) in the final model. Compared to model 3, categorical outcomes provided an additional ΔR² of only 0.051 (p<0.001) in the final model. These findings highlight that the EW-MSM staging tool captures MADRS improvement to remission more effectively compared to categorical outcomes.

Table 4
www.frontiersin.org

Table 4. Hierarchical regression analysis of EW-MSM total score and categorical outcomes as predictors of Δ-MADRS scores in remitters (N = 233).

However, it should be noted that baseline depression severity (R-MADRS), embedded by definition in Δ-MADRS, is also included in EW-MSM as its severity dimension. As a result, R-MADRS was more strongly associated with EW-MSM (R2 = 0.241, p=1.55E-15) than with categorical outcomes (R2 = 0.137, p=4.18E-08), suggesting that EW-MSM superiority in predicting Δ-MADRS might be due to its stronger association with R-MADRS. This was formally tested by adjusting hierarchical regression models above for R-MADRS. Superiority of EW-MSM over categorical outcomes was lost after adjustment, corroborating our hypothesis.

3.5 Aim 3: discriminative utility of EW-MSM across categorical outcomes

Table 5 presents descriptives for EW-MSM total score and dimensions, examined both as continuous and categorical variables, along with their comparisons across categorical outcomes. TRD remitters (group C1) exhibited the highest EW-MSM duration, analyzed either way (p<0.001). Although EW-MSM severity showed no significant differences when analyzed as a continuous variable, significant differences emerged when examined categorically (p=0.002). Moderate severity was more prevalent in group A, whereas severe depression without psychotic features was more prevalent in group C1 than A. No significant differences were observed for severe depression with psychotic features. EW-MSM Antidepressants was highest in group C1, lowest in group A and intermediate in group B (p<0.001; C1>B>A). Statistically significant differences were also found in EW-MSM Augmentation (p=0.009; C1>A). EW-MSM ECT scored 0 in all groups as no patient had received ECT treatment. EW-MSM total scores clearly distinguished between groups (p<0.001; C1>B>A) (Figure 2). Categorical outcomes explained 42% of the variance of EW-MSM (R2 = 0.42, p < 0.001) and vice versa. When established cut-offs for severity of resistance (27) were applied, 20.6% exhibited mild resistance, 76.8% moderate resistance and 2.6% severe resistance. Mild resistance was more frequent in group A, moderate resistance in groups B and C1 while severe resistance in C1 (p< 0.001). In summary, EW-MSM was strongly but not perfectly correlated with categorical outcomes; a clear gradient of resistance across all three outcome categories was displayed only by the Antidepressants dimension and by EW-MSM total score.

Table 5
www.frontiersin.org

Table 5. Comparative analysis of EW-MSM dimensions as continuous and categorical variables across categorical outcomes in remitters (N = 233).

Figure 2
Violin plot comparing EW-MSM total scores among three groups: 1st antidepressant (AD) trial remitters, 2nd AD trial remitters, and treatment-resistant depression (TRD) remitters. TRD remitters show higher score distributions.

Figure 2. Violin plots of EW-MSM by remission-based categorical outcomes. AD, Antidepressant; EW-MSM, Episode-wide Maudsley Staging Method; TRD, Treatment-resistant depression.

ROC curve analyses were conducted to evaluate the discriminative utility of EW-MSM total scores for categorical outcomes (Figure 3). AUC values, sensitivity, specificity, accuracy (Acc, i.e., correct classification rates) and balanced accuracy (BAC) were calculated for various comparisons. The highest classification accuracy was observed in the comparison of the most distant categories A vs. C1 (AUC = 0.909, 95% CI = 0.86-0.95; Acc=81.9%, BAC = 83.0% at a cut-off score of 7/8). Comparison of the two last categories (B vs. C1, AUC = 0.790, 95% CI = 0.72-0.86) yielded an Acc=72.7% and BAC = 73% at the more rigorous cut-off of 8/9. Distinction between A and B was poor (AUC = 0.716, 95% CI = 0.64-0.79; Acc=68.9%, BAC = 65.7% at a cut-off of 7/8). Comparisons were also performed for combined groups. Comparison between A and the two last groups combined (A vs. B+C1) exhibited AUC = 0.815 (95% CI = 0.76-0.87) and Acc=74.3%, BAC = 74.6% at a cut-off of 7/8. Comparison between the two first groups combined and C1 (A+B vs. C1) exhibited AUC = 0.865 (95% CI = 0.81-0.92) and Acc=85.0%, BAC = 77.6% at a cut-off of 8/9. These findings suggest that EW-MSM total score effectively (AUC≥0.80) discriminated between the most distant categories (1st AD trial remitters vs. TRD remitters) and between either of these vs. all other categories combined but not among adjacent categories. The most efficient and reliable discrimination (AUC 95% CI beyond 0.80) was between TRD remitters and 1st AD trial remitters or 1st and 2nd AD trial remitters combined. However, these findings should be interpreted cautiously as the two measures partially share treatment exposure information (number of ADs in EW-MSM and number of AD trials in categorical outcomes, which are highly correlated), which can inflate discrimination performance.

Figure 3
Five receiver operating characteristic (ROC) curve graphs labeled A to E compare sensitivity versus one minus specificity for different group classifications, each displaying an area under the curve (AUC) value for diagnostic performance.

Figure 3. Comparative ROC Curve analyses for remission-based categorical outcomes using EW-MSM total score as classifier. (A) Comparison between group A vs. group B: AUC = 0.716; optimal cut-off=7/8: Sn=53.23%, Sp=78.10%, PPV = 58.93%, NPV = 73.87%, PLR = 2.43, NLR = 0.60, Acc=68.86%, BAC = 65.67%. (B) Comparison between group A vs. group C1: AUC = 0.909; optimal cut-off=7/8: Sn=87.88%, Sp=78.1%, PPV = 71.6%, NPV = 91.11%, PLR = 4.01, NLR = 0.155, Acc=81.87%, BAC = 82.99%. (C) Comparison between group B vs. C1: AUC = 0.79; optimal cut-off=8/9: Sn=60.61%, Sp=85.48%, PPV = 81.63%, NPV = 67.09%, PLR = 4.18, NLR = 0.46, Acc=72.66%, BAC = 73.05%. (D) Comparison between combined groups A and B vs. C1: AUC = 0.865; optimal cut-off=8/9: Sn=60.61%, Sp=94.61%, PPV = 81.63%, NPV = 85.87%, PLR = 11.25, NLR = 0.42, Acc=84.98%, BAC = 77.61%. (E) Comparison between group A vs. combined groups B and C1: AUC = 0.815; optimal cut-off=7/8: Sn=71.09%, Sp=78.1%, PPV = 79.82%, NPV = 68.91%, PLR = 3.25, NLR = 0.37, Acc=74.25%, BAC = 74.60%. Group definitions: group A = 1st AD trial remitters; group B = 2nd AD trial remitters; group C1 = TRD remitters; Acc, Accuracy; AD, Antidepressant; AUC, Area Under the Curve; BAC, Balanced Accuracy; EW-MSM, Episode-wide Maudsley Staging Method; NLR, Negative Likelihood Ratio; NPV, Negative Predictive Value; ROC Curve, Receiver Operating Characteristic Curve; PLR, Positive Likelihood Ratio; PPV, Positive Predictive Value; Sn, sensitivity; Sp, specificity; TRD, Treatment-resistant depression.

4 Discussion

This study is, to our knowledge, the first to apply EW-MSM, a dimensional score of treatment resistance recorded over the whole depressive episode, from episode onset to remission, as previous research only employed MSM at randomly timed snapshots, such as at hospital discharge, to predict short- or long-term outcomes. This score summarizes most of the clinical and treatment-related information that is required for treatment resistance quantification and that was collected during our naturalistic trial. It was, therefore, considered as a gold standard against which typical remission-based categorical outcomes were compared. Comparison between these two measures of resistance focused on three targets: 1) which were their clinical and treatment-related predictors, 2) how well the two measures predicted depression improvement to remission, and 3) how the two measures were related with each other.

The prevalence of TRD using our remission-based categorical definition was 37.5% (100 TRD patients out of 267 in total). This compares to around 45% of patients cumulatively failing to achieve remission at the second step of the STAR*D study, in which treatment options were more limited, available only after the first step and sequenced in a structured way (20). Using their response-based definition, the GSRD recorded a TRD prevalence of 50.7% and 40.6% in the TRD-I (47) and TRD-III (45, 48, 49) cross-sectional studies, respectively, and 54.5% in the TRD-II (50) prospective study. In summary, discrepances across cohorts may be attributed to different recruitment strategies, settings, treatment procedures followed and TRD definitions employed.

EW-MSM and categorical outcomes were predicted by shared and distinct sets of clinicodemographic variables in multivariate models. Baseline depressive severity and lifetime OCD comorbidity emerged as significant predictors for both EW-MSM and categorical models, while current psychotic features were associated with EW-MSM outcomes only, and family history of MDD in first-degree relatives was significant only in the categorical framework. Baseline depression severity was more strongly associated with EW-MSM than with categorical outcomes, since it is reflected by its severity dimension. Baseline depression severity also emerged as a key predictor of resistance in STAR*D findings, showing that higher initial symptom scores were associated with reduced likelihood of remission at the second step (51), and in GSRD results (25), that similarly identified severity as a robust correlate of resistance in TRD-I (47), TRD-II (52) and TRD-III (53) studies. Of note, mean R-MADRS was 40.6 in our tertiary-care study, with >80% of remitters being hospitalized during their current MDE, while in TRD-II and TRD-III studies mean R-MADRS was 31.5 and 34.1, respectively (48, 50). Lifetime OCD comorbidity (in 9.4% of remitters in our study) was also identified as predictor of resistance at the STAR*D second step (51); in the TRD-III study, the prevalence of comorbid OCD was very low (1.65%) and, although linked with higher suicidal risk and polypharmacy, it was not associated with resistance (54). Current psychotic features (in 19.3% of remitters), a predictor embedded within the MSM severity dimension, were consequently strongly linked with EW-MSM scores in our cohort, in agreement with the TRD-III study (53), where prevalence of psychotic features was 10.9% (48). The association between family history of MDD (in 28.4% of remitters) and categorical outcomes corroborates similar findings in the TRD-II study (52) pointing to the heritable vulnerability of TRD (55). GSRD studies have identified additional clinical predictors of resistance, including number of hospitalizations, suicidality, recurrent episodes, early age of onset, comorbid anxiety disorders and anhedonia/melancholic features (47, 48, 52, 53, 5658). These factors were not replicated in our study, which may be resulting from our remission-based definition as opposed to response-based frameworks, as well as the single tertiary-care center design and limited power of our sample.

In our study, remitters received a median of 2 ADs per episode, rising to 4 among TRD remitters. Nearly half (49.4%) received AD combinations and 83.3% underwent augmentation during their current MDE; respective percentages in TRD remitters were 84.8% and 92.4%. These rates indicate substantial treatment intensity and are broadly consistent with findings from the GSRD studies. In TRD-I, 18.7% of TRD patients had three and 9.2% four ADs during the index episode (47). In TRD-III, 60.6% received combination or augmentation regimens (45), with 29.5% on AD combinations and augmentation often involving antipsychotics (25.7%) or mood stabilizers (11.3%), while mean number of psychiatric drugs administered reached 2.18. Our cohort showed high pharmacological exposure, with mean cumulative fluoxetine equivalent doses of 89.9 mg across all trials versus 39.9 mg in TRD-III (45). This difference likely reflects the longer follow-up of our study subjects followed up to remission, demanding more AD trials and more intensive dose optimization. Treatment-related characteristics were unsurprisingly strongly associated with both EW-MSM and categorical outcomes. Longer episode duration predicted higher resistance in both EW-MSM and categorical models. Moreover, in multivariate models EW-MSM was associated with the total number of ADs received while categorical outcomes with the use of AD combinations. Additionally, EW-MSM was significantly predicted by the use of augmentation strategies, since these are included as a separate indicator of resistance. Overall, both clinicodemographic and treatment-related characteristics predicted EW-MSM more efficiently than categorical outcomes, underscoring its superior ability to reflect the multidimensional aspects of resistance (clinical burden, treatment intensity).

The hierarchical regression analysis demonstrated the superiority of EW-MSM over categorical outcomes in capturing symptom improvement from baseline to remission. This was mainly attributed, in a sensitivity analysis, to the fact that EW-MSM, through its severity dimension, was more strongly associated with baseline depression severity than categorical outcomes. Furthermore, EW-MSM can better reflect several other key aspects of resistance while categorical outcomes only rely on discrete AD trial counts. As a result, EW-MSM provides a more accurate and comprehensive index of resistance, which also captures therapeutic gain more efficiently than categorical outcomes.

Comparative analyses of EW-MSM dimensions across categorical outcomes showed a clear gradient of resistance, increasing from 1st to 2nd AD trial and TRD remitters (C1>B>A), only for the Antidepressants dimension, as number of ADs in EW-MSM is highly correlated with number of (monotherapy or combination) AD trials in categorical outcomes. Other EW-MSM dimensions were not equally discriminative. EW-MSM Duration clearly separated TRD remitters from early remitters but not among the two early remitters subgroups. EW-MSM Severity performed poorly and only when examined categorically while EW-MSM Augmentation clearly separated only among the most distant categories. These findings suggest that typical categorical outcomes are clearly separated among them only by the number of ADs received, based on which they are defined, but less clearly by several other key aspects of resistance, such as clinical severity, episode duration, and augmentation strategies. Yet, EW-MSM total score was strongly though not perfectly correlated with categorical outcomes and performed as well as the Antidepressants dimension, clearly distinguishing between all three categories on the same gradient, maybe because this dimension contributes up to 5 points to the total score.

The ROC curve analyses demonstrated that EW-MSM discriminated effectively (AUC≥0.80) only between the most distant categories (1st AD trial remitters vs. TRD remitters; AUC = 0.91) but not among adjacent categories. The suboptimal discriminative efficiency of EW-MSM between adjacent categories, especially among the first two, arises from the fact that our sample included patients with high severity intensely treated early on. Even in 1st AD trial remitters, 20% received AD combination and 75% received augmentation. Most remitters (76.8%) had moderate resistance by MSM cut-offs, making it more difficult to distinguish subgroups in them. As noted above, number of ADs tightly aligned with outcome categories, but other EW-MSM dimensions showed higher dispersion across them, resulting in poor discrimination between adjacent categories in ROC analyses. As has been proposed, MSM could become more granular and increase its discriminative efficiency by incorporating the number of augmentation strategies and AD combinations used (13).

Comparisons in ROC analyses were also extended to include combined groups. As pharmacological trials in TRD patients are sparse (15, 59), treatment guidance often relies on trials in patients failing to remit after at least one adequate AD trial (60). Psychiatric pharmacogenetic studies also often focus on non-remitters after one AD trial (61). Therefore, comparing 1st AD trial remitters with all other groups is of clinical relevance; discrimination among them was fair in our study (AUC = 0.815). When 1st and 2nd AD trial remitters were combined and contrasted with TRD remitters, the discriminative efficiency was even higher (AUC = 0.865), suggesting that TRD remitters are quite distinct from early remitters and lending validity to the TRD definition. Discriminative cut-offs identified in ROC analyses included EW-MSM scores of 8 and 9 or higher. The former is useful in distinguishing 1st AD trial remitters from all other groups combined or TRD remitters; the latter in distinguishing between TRD and early remitters. Both cut-offs lie in the moderate resistance range of scores (27).

Overall, findings from the comparisons between EW-MSM and categorical outcomes on our three aims are summarized as follows: 1) EW-MSM score was significantly associated with both clinicodemographic and treatment-related predictors, overlapping with, but also extending beyond those associated with categorical outcomes. Both approaches converged on baseline severity, current MDE duration, lifetime OCD comorbidity and number of ADs or AD combination as significant predictors in multivariate models, while EW-MSM was additionally predicted by current psychotic features and use of augmentation strategies. However, predictions were always more efficient for EW-MSM than for categorical outcomes, demonstrating its superior ability to reflect the clinical burden and treatment intensity aspects of resistance. 2) EW-MSM was more strongly associated with baseline severity than categorical outcomes and consequently predicted depression improvement to remission more efficiently. 3) EW-MSM was strongly though not perfectly associated with categorical outcomes. Only the Antidepressants dimension and the total score showed a clear gradient of resistance across all three outcome categories. As a result, EW-MSM effectively discriminated between the most distant categories (1st AD trial remitters vs. TRD remitters) but not among adjacent categories. This does not necessarily point to an EW-MSM drawback; categorical outcomes defined by number of AD trial failures were not as distinct regarding other key aspects of resistance captured by EW-MSM and were therefore not adequately distinguishable by it.

This study has notable strengths. To our knowledge, this is the first application of MSM across the entire episode, whereas previous research had employed it at random snapshots, such as at hospital discharge (2832). We derived a continuous score of episode-wide resistance, capturing both clinical burden (symptom severity, episode duration) and treatment intensity (AD trials, AD combination and augmentation). This was used as a reference against which remission-based categorical classifications were compared. The longitudinal, naturalistic design of the study mirrors real-world clinical challenges related to following up TRD patients to remission. We prioritized clinical remission over symptom response, aligning with contemporary treatment objectives emphasizing remission and functional recovery beyond remission, but recognizing that among them clinical remission is the most feasible and finally the preferred endpoint to define a successful clinical outcome (22).

Several limitations should also be acknowledged. First, the cohort did not include patients receiving ECT or newer interventions indicated in TRD, such as esketamine, which may limit generalizability to all therapeutic strategies. Moreover, unavailability of ECT did not allow scoring of the full EW-MSM tool in this cohort. Total scores might be higher, particularly for TRD patients, if ECT was available. Second, MSM does not capture non-pharmacological interventions, like psychotherapy or rTMS; consequently, patients receiving psychotherapy were excluded from our study to maintain consistency in applying MSM, though at the cost of external validity. Third, no objective measures of adherence, such as AD plasma levels, were used; suboptimal adherence may lead to pseudo-resistance (12). Fourth, the timing of episode onset and baseline depression severity were retrospectively determined and, therefore, potentially subject to recall bias. Finally, this study was conducted at a single tertiary-care center and included intensely treated patients with high severity, introducing selection bias; therefore, replication in other clinical settings is needed to ensure the findings are representative of broader clinical populations.

Our findings suggest several directions for future clinical and translational research. Following up MDD (incl. TRD) patients to remission in clinical research settings and using EW-MSM to measure treatment resistance could help identify data-driven cut-offs of resistance through latent class/profile or cluster analyses of EW-MSM dimensions, possibly combined with patient-reported outcomes. These cut-offs could prove of greater clinical utility than typical categorical classifications and be used for drug discovery or stratification in clinical trials. Furthermore, EW-MSM total score or specific dimensions might be tested, in an unbiased and hypothesis-free way (i.e. without relying on arbitrary cut-offs based on AD trial failures), for correlation with putative biological (e.g. inflammatory or oxidative stress biomarkers, neuroimaging data, polygenic scores, etc.), clinical (e.g. psychiatric or medical comorbidity) or psychological (e.g. personality profile, cognition, childhood trauma, coping strategies, etc.) predictors. Timing should be considered in these analyses, facilitating the identification of stage-specific biomarkers for use in precision psychiatry frameworks. In such pragmatic studies follow-up can be long to accurately reflect routine care but could be justified by the expected benefits. Future studies should evaluate EW-MSM scoring in larger and more diverse populations to confirm applicability and strengthen generalizability. Finally, feasibility could be enhanced by applying the EW-MSM approach in electronic health records databases or patient registries.

In conclusion, our study highlighted the clinical importance of prioritizing remission over response as the therapeutic target and for defining resistance, showing that remission is feasible even among patients with TRD, yet after more treatment attempts. EW-MSM produced a continuous score of episode-wide resistance, capturing both clinical burden (symptom severity, episode duration) and treatment intensity, which was used as a reference against which remission-based categorical outcomes were compared. EW-MSM and categorical outcomes were predicted by shared but also distinct sets of clinicodemographic and treatment-related variables. However, EW-MSM reflected clinical burden, incl. baseline depression severity, and treatment intensity aspects of resistance more strongly than categorical outcomes and consequently predicted depression improvement to remission more efficiently. EW-MSM scores followed a clear gradient of resistance across categorical outcomes and effectively discriminated between the most distant categories but not among adjacent ones. Overall, our results demonstrate the added value of using EW-MSM and support its use in clinical and translational research to identify the clinical, biological and psychological correlates of resistance.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics statement

The studies involving humans were approved by Research Ethics Committee of Attikon University General Hospital. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.

Author contributions

AA: Formal Analysis, Data curation, Writing – original draft, Investigation. SP: Writing – review & editing. DD: Writing – review & editing. NS: Writing – review & editing. JM: Writing – review & editing. AS: Writing – review & editing. AC: Writing – review & editing. PF: Supervision, Project administration, Formal Analysis, Funding acquisition, Writing – original draft, Methodology, Software, Investigation, Visualization, Resources, Conceptualization, Data curation.

Funding

The author(s) declared that financial support was received for this work and/or its publication. Funding for open access publishing was partly provided by the National and Kapodistrian University of Athens, Faculty of Medicine.

Conflict of interest

The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

The authors PF, NS declared that they were an editorial board member of Frontiers, at the time of submission. This had no impact on the peer review process and the final decision.

Generative AI statement

The author(s) declared that generative AI was not used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpsyt.2026.1769246/full#supplementary-material

References

1. James SL, GBD 2017 Disease and Injury Incidence and Prevalence Collaborators, Vos T, and Murray CJL. Global, regional, and national incidence, prevalence, and years lived with disability for 354 diseases and injuries for 195 countries and territories, 1990–2017: a systematic analysis for the Global Burden of Disease Study 2017. Lancet. (2018) 392:1789–858. doi: 10.1016/S0140-6736(18)32279-7

PubMed Abstract | Crossref Full Text | Google Scholar

2. McIntyre RS, Alsuwaidan M, Baune BT, Berk M, Demyttenaere K, Goldberg JF, et al. Treatment-resistant depression: definition, prevalence, detection, management, and investigational interventions. World Psychiatry. (2023) 22:394–412. doi: 10.1002/wps.21120

PubMed Abstract | Crossref Full Text | Google Scholar

3. Pappa S, Shah M, Young S, Anwar T, and Ming T. Care pathways, prescribing practices and treatment outcomes in major depressive disorder and treatment-resistant depression: retrospective, population-based cohort study. BJPsych Open. (2024) 10:e32. doi: 10.1192/bjo.2023.627

PubMed Abstract | Crossref Full Text | Google Scholar

4. Adekkanattu P, Olfson M, Susser LC, Patra B, Vekaria V, Coombes BJ, et al. Comorbidity and healthcare utilization in patients with treatment resistant depression: A large-scale retrospective cohort analysis using electronic health records. J Affect Disord. (2023) 324:102–13. doi: 10.1016/j.jad.2022.12.044

PubMed Abstract | Crossref Full Text | Google Scholar

5. Lundberg J, Cars T, Lööv S-Å, Söderling J, Sundström J, Tiihonen J, et al. Association of treatment-resistant depression with patient outcomes and health care resource utilization in a population-wide study. JAMA Psychiatry. (2023) 80:167–75. doi: 10.1001/jamapsychiatry.2022.3860

PubMed Abstract | Crossref Full Text | Google Scholar

6. Heerlein K, Young AH, Otte C, Frodl T, Degraeve G, Hagedoorn W, et al. Real-world evidence from a European cohort study of patients with treatment resistant depression: Baseline patient characteristics. J Affect Disord. (2021) 283:115–22. doi: 10.1016/j.jad.2020.11.124

PubMed Abstract | Crossref Full Text | Google Scholar

7. Rathod S, Denee T, Eva J, Kerr C, Jacobsen N, Desai M, et al. Health-related quality of life burden associated with treatment-resistant depression in UK patients: Quantitative results from a mixed-methods non-interventional study. J Affect Disord. (2022) 300:551–62. doi: 10.1016/j.jad.2021.12.090

PubMed Abstract | Crossref Full Text | Google Scholar

8. Zhdanava M, Pilon D, Ghelerter I, Chow W, Joshi K, Lefebvre P, et al. The prevalence and national burden of treatment-resistant depression and major depressive disorder in the United States. J Clin Psychiatry. (2021) 82:20m13699. doi: 10.4088/JCP.20m13699

PubMed Abstract | Crossref Full Text | Google Scholar

9. Gronemann FH, Petersen J, Alulis S, Jensen KJ, Riise J, Ankarfeldt MZ, et al. Treatment patterns in patients with treatment-resistant depression in Danish patients with major depressive disorder. J Affect Disord. (2021) 287:204–13. doi: 10.1016/j.jad.2021.03.029

PubMed Abstract | Crossref Full Text | Google Scholar

10. Bergfeld IO, Mantione M, Figee M, Schuurman PR, Lok A, and Denys D. Treatment-resistant depression and suicidality. J Affect Disord. (2018) 235:362–7. doi: 10.1016/j.jad.2018.04.016

PubMed Abstract | Crossref Full Text | Google Scholar

11. Pappa S, Shah M, Young S, Anwar T, and Ming T. Health-care utilisation and associated economic burden in major depressive disorder and treatment-resistant depression in a longitudinal population-based cohort study of 110.000 patients. J Psychiatr Res. (2024) 180:511–8. doi: 10.1016/j.jpsychires.2024.11.022

PubMed Abstract | Crossref Full Text | Google Scholar

12. Berlim MT and Turecki G. What is the meaning of treatment resistant/refractory major depression (TRD)? A systematic review of current randomized trials. Eur Neuropsychopharmacol. (2007) 17:696–707. doi: 10.1016/j.euroneuro.2007.03.009

PubMed Abstract | Crossref Full Text | Google Scholar

13. Ruhé HG, van Rooijen G, Spijker J, Peeters FP, and Schene AH. Staging methods for treatment resistant depression. A systematic review. J Affect Disord. (2012) 137:35–45. doi: 10.1016/j.jad.2011.02.020

PubMed Abstract | Crossref Full Text | Google Scholar

14. Sforzini L, Worrell C, Kose M, Anderson IM, Aouizerate B, Arolt V, et al. A Delphi-method-based consensus guideline for definition of treatment-resistant depression for clinical trials. Mol Psychiatry. (2022) 27:1286–99. doi: 10.1038/s41380-021-01381-x

PubMed Abstract | Crossref Full Text | Google Scholar

15. Fiorillo A, Demyttenaere K, Martiadis V, and Martinotti G. Editorial: Treatment resistant depression (TRD): epidemiology, clinic, burden and treatment. Front Psychiatry. (2025) 16:1588902. doi: 10.3389/fpsyt.2025.1588902

PubMed Abstract | Crossref Full Text | Google Scholar

16. Demyttenaere K and Van Duppen Z. The impact of (the concept of) treatment-resistant depression: an opinion review. Int J Neuropsychopharmacol. (2019) 22:85–92. doi: 10.1093/ijnp/pyy052

PubMed Abstract | Crossref Full Text | Google Scholar

17. European Medicines Agency. Guideline on clinical investigation of medicinal products in the treatment of depression. London, UK: Committee for Medicinal Products for Human Use (CHMP (2013). Revision 2.

Google Scholar

18. Möller H-J. Outcomes in major depressive disorder: The evolving concept of remission and its implications for treatment. World J Biol Psychiatry. (2008) 9:102–14. doi: 10.1080/15622970801981606

PubMed Abstract | Crossref Full Text | Google Scholar

19. Rush AJ, Fava M, Wisniewski SR, Lavori PW, Trivedi MH, Sackeim HA, et al. Sequenced treatment alternatives to relieve depression (STAR*D): rationale and design. Control Clin Trials. (2004) 25:119–42. doi: 10.1016/s0197-2456(03)00112-0

PubMed Abstract | Crossref Full Text | Google Scholar

20. Rush AJ, Trivedi MH, Wisniewski SR, Nierenberg AA, Stewart JW, Warden D, et al. Acute and longer-term outcomes in depressed outpatients requiring one or several treatment steps: a STAR*D report. Am J Psychiatry. (2006) 163:1905–17. doi: 10.1176/ajp.2006.163.11.1905

PubMed Abstract | Crossref Full Text | Google Scholar

21. McIntyre RS, Lee Y, and Mansur RB. Treating to target in major depressive disorder: response to remission to functional recovery. CNS Spectr. (2015) 20:20–30. doi: 10.1017/s1092852915000826

PubMed Abstract | Crossref Full Text | Google Scholar

22. Fekadu A, Donocik JG, and Cleare AJ. Standardisation framework for the Maudsley staging method for treatment resistance in depression. BMC Psychiatry. (2018) 18:100. doi: 10.1186/s12888-018-1679-x

PubMed Abstract | Crossref Full Text | Google Scholar

23. Salloum NC and Papakostas GI. Staging treatment intensity and defining resistant depression: historical overview and future directions. J Clin Psychiatry. (2019) 80:18r12250. doi: 10.4088/JCP.18r12250

PubMed Abstract | Crossref Full Text | Google Scholar

24. Thase ME and Rush AJ. When at first you don’t succeed: sequential strategies for antidepressant nonresponders. J Clin Psychiatry. (1997) 58:23–9.

PubMed Abstract | Google Scholar

25. Bartova L, Dold M, Kautzky A, Fabbri C, Spies M, Serretti A, et al. Results of the European Group for the Study of Resistant Depression (GSRD) - basis for further research and clinical practice. World J Biol Psychiatry. (2019) 20:427–48. doi: 10.1080/15622975.2019.1635270

PubMed Abstract | Crossref Full Text | Google Scholar

26. Souery D, Amsterdam J, de Montigny C, Lecrubier Y, Montgomery S, Lipp O, et al. Treatment resistant depression: methodological overview and operational criteria. Eur Neuropsychopharmacol. (1999) 9:83–91. doi: 10.1016/s0924-977x(98)00004-2

PubMed Abstract | Crossref Full Text | Google Scholar

27. Fekadu A, Wooderson S, Donaldson C, Markopoulou K, Masterson B, Poon L, et al. A multidimensional tool to quantify treatment resistance in depression: the Maudsley staging method. J Clin Psychiatry. (2009) 70:177–84. doi: 10.4088/jcp.08m04309

PubMed Abstract | Crossref Full Text | Google Scholar

28. Fekadu A, Wooderson SC, Markopoulou K, and Cleare AJ. The Maudsley Staging Method for treatment-resistant depression: prediction of longer-term outcome and persistence of symptoms. J Clin Psychiatry. (2009) 70:952–7. doi: 10.4088/JCP.08m04728

PubMed Abstract | Crossref Full Text | Google Scholar

29. Van Belkum SM, Geugies HH, Lysen TS, Cleare AJ, Peeters FP, Penninx BW, et al. Validity of the Maudsley Staging Method in predicting treatment-resistant depression outcome using the Netherlands study of depression and anxiety. J Clin Psychiatry. (2018) 79:22060. doi: 10.4088/JCP.17m11475

PubMed Abstract | Crossref Full Text | Google Scholar

30. Icick R, Millet É, Curis E, Bellivier F, and Lépine JP. Predictive value of baseline resistance in early response to antidepressants. J Affect Disord. (2014) 161:127–35. doi: 10.1016/j.jad.2014.03.002

PubMed Abstract | Crossref Full Text | Google Scholar

31. Fekadu A, Rane LJ, Wooderson SC, Markopoulou K, Poon L, and Cleare AJ. Prediction of longer-term outcome of treatment-resistant depression in tertiary care. Br J Psychiatry. (2012) 201:369–75. doi: 10.1192/bjp.bp.111.102665

PubMed Abstract | Crossref Full Text | Google Scholar

32. van Diermen L, Hebbrecht K, Schrijvers D, Sabbe BCG, Fransen E, and Birkenhäger TK. The Maudsley Staging Method as predictor of electroconvulsive therapy effectiveness in depression. Acta Psychiatr Scand. (2018) 138:605–14. doi: 10.1111/acps.12962

PubMed Abstract | Crossref Full Text | Google Scholar

33. Ferentinos P, Fountoulakis KN, Lewis CM, Porichi E, Dikeos D, Papageorgiou C, et al. Validating a two-dimensional bipolar spectrum model integrating DSM-5’s mixed features specifier for Major Depressive Disorder. Compr Psychiatry. (2017) 77:89–99. doi: 10.1016/j.comppsych.2017.06.007

PubMed Abstract | Crossref Full Text | Google Scholar

34. Mullins N, Forstner AJ, O’Connell KS, Coombes B, Coleman JRI, Qiao Z, et al. Genome-wide association study of more than 40,000 bipolar disorder cases provides new insights into the underlying biology. Nat Genet. (2021) 53:817–29. doi: 10.1038/s41588-021-00857-4

PubMed Abstract | Crossref Full Text | Google Scholar

35. First MB, Williams JB, Karg RS, and Spitzer RL. Structured clinical interview for DSM-5 disorders: Clinician Version (SCID-5-CV). Arlington, VA: American Psychiatric Association (2016).

Google Scholar

36. Montgomery SA and Åsberg M. A new depression scale designed to be sensitive to change. Br J Psychiatry. (1979) 134:382–9. doi: 10.1192/bjp.134.4.382

PubMed Abstract | Crossref Full Text | Google Scholar

37. Snaith RP, Harrop FM, Newby DA, and Teale C. Grade scores of the montgomery-asberg depression and the clinical anxiety scales. Br J Psychiatry. (1986) 148:599–601. doi: 10.1192/bjp.148.5.599

PubMed Abstract | Crossref Full Text | Google Scholar

38. Hayasaka Y, Purgato M, Magni LR, Ogawa Y, Takeshima N, Cipriani A, et al. Dose equivalents of antidepressants: Evidence-based recommendations from randomized controlled trials. J Affect Disord. (2015) 180:179–84. doi: 10.1016/j.jad.2015.03.021

PubMed Abstract | Crossref Full Text | Google Scholar

39. WHO Collaborating Centre for Drug Statistics Methodology. ATC (Anatomical Therapeutic Chemical)/DDD (Defined Daily Dose) Index 2025 (2025). Available online at: https://atcddd.fhi.no/atc_ddd_index/ (Accessed November 14, 2025).

Google Scholar

40. Young RC, Biggs JT, Ziegler VE, and Meyer DA. A rating scale for mania: reliability, validity and sensitivity. Br J Psychiatry. (1978) 133:429–35. doi: 10.1192/bjp.133.5.429

PubMed Abstract | Crossref Full Text | Google Scholar

41. Guy W. ECDEU assessment manual for psychopharmacology. Rockville, MD: US Department of Health, Education, and Welfare (1976).

Google Scholar

42. Hawley CJ, Gale TM, and Sivakumaran T. Defining remission by cut off score on the MADRS: selecting the optimal value. J Affect Disord. (2002) 72:177–84. doi: 10.1016/s0165-0327(01)00451-7

PubMed Abstract | Crossref Full Text | Google Scholar

43. Samara MT, Levine SZ, and Leucht S. Linkage of young mania rating scale to clinical global impression scale to enhance utility in clinical practice and research trials. Pharmacopsychiatry. (2023) 56:18–24. doi: 10.1055/a-1841-6672

PubMed Abstract | Crossref Full Text | Google Scholar

44. StataCorp. Stata Statistical Software: Release 14. College Station, TX: StataCorp LP (2015).

Google Scholar

45. Dold M, Bartova L, Mendlewicz J, Souery D, Serretti A, Porcelli S, et al. Clinical correlates of augmentation/combination treatment strategies in major depressive disorder. Acta Psychiatr Scand. (2018) 137:401–12. doi: 10.1111/acps.12870

PubMed Abstract | Crossref Full Text | Google Scholar

46. Serretti A, Kasper S, Bartova L, Zohar J, Souery D, Montgomery S, et al. Clinical predictors of treatment resistant depression. Eur Neuropsychopharmacol. (2025) 98:26–34. doi: 10.1016/j.euroneuro.2025.06.011

PubMed Abstract | Crossref Full Text | Google Scholar

47. Souery D, Oswald P, Massat I, Bailer U, Bollen J, Demyttenaere K, et al. Clinical factors associated with treatment resistance in major depressive disorder: results from a European multicenter study. J Clin Psychiatry. (2007) 68:1062–70. doi: 10.4088/jcp.v68n0713

PubMed Abstract | Crossref Full Text | Google Scholar

48. Dold M, Bartova L, Fugger G, Kautzky A, Mitschek MMM, Fabbri C, et al. Melancholic features in major depression - a European multicenter study. Prog Neuropsychopharmacol Biol Psychiatry. (2021) 110:110285. doi: 10.1016/j.pnpbp.2021.110285

PubMed Abstract | Crossref Full Text | Google Scholar

49. Fugger G, Bartova L, Fabbri C, Fanelli G, Dold M, Swoboda MMM, et al. The sociodemographic and clinical profile of patients with major depressive disorder receiving SSRIs as first-line antidepressant treatment in European countries. Eur Arch Psychiatry Clin Neurosci. (2022) 272:715–27. doi: 10.1007/s00406-021-01368-3

PubMed Abstract | Crossref Full Text | Google Scholar

50. Souery D, Calati R, Papageorgiou K, Juven-Wetzler A, Gailledreau J, Modavi D, et al. What to expect from a third step in treatment resistant depression: A prospective open study on escitalopram. World J Biol Psychiatry. (2015) 16:472–82. doi: 10.3109/15622975.2014.987814

PubMed Abstract | Crossref Full Text | Google Scholar

51. Rush AJ, Wisniewski SR, Warden D, Luther JF, Davis LL, Fava M, et al. Selecting among second-step antidepressant medication monotherapies: predictive value of clinical, demographic, or first-step treatment features. Arch Gen Psychiatry. (2008) 65:870–80. doi: 10.1001/archpsyc.65.8.870

PubMed Abstract | Crossref Full Text | Google Scholar

52. Balestri M, Calati R, Souery D, Kautzky A, Kasper S, Montgomery S, et al. Socio-demographic and clinical predictors of treatment resistant depression: A prospective European multicenter study. J Affect Disord. (2016) 189:224–32. doi: 10.1016/j.jad.2015.09.033

PubMed Abstract | Crossref Full Text | Google Scholar

53. Kautzky A, Dold M, Bartova L, Spies M, Kranz GS, Souery D, et al. Clinical factors predicting treatment resistant depression: affirmative results from the European multicenter study. Acta Psychiatr Scand. (2019) 139:78–88. doi: 10.1111/acps.12959

PubMed Abstract | Crossref Full Text | Google Scholar

54. Dold M, Bartova L, Souery D, Mendlewicz J, Porcelli S, Serretti A, et al. Low comorbid obsessive-compulsive disorder in patients with major depressive disorder - Findings from a European multicenter study. J Affect Disord. (2018) 227:254–9. doi: 10.1016/j.jad.2017.10.033

PubMed Abstract | Crossref Full Text | Google Scholar

55. Fabbri C, Corponi F, Souery D, Kasper S, Montgomery S, Zohar J, et al. The genetics of treatment-resistant depression: A critical review and future perspectives. Int J Neuropsychopharmacol. (2019) 22:93–104. doi: 10.1093/ijnp/pyy024

PubMed Abstract | Crossref Full Text | Google Scholar

56. Zaninotto L, Souery D, Calati R, Sentissi O, Kasper S, Akimova E, et al. Treatment resistance in severe unipolar depression: no association with psychotic or melancholic features. Ann Clin Psychiatry. (2013) 25:97–106. doi: 10.1177/104012371302500204

Crossref Full Text | Google Scholar

57. Dold M, Bartova L, Fugger G, Kautzky A, Souery D, Mendlewicz J, et al. Major depression and the degree of suicidality: results of the european group for the study of resistant depression (GSRD). Int J Neuropsychopharmacol. (2018) 21:539–49. doi: 10.1093/ijnp/pyy009

PubMed Abstract | Crossref Full Text | Google Scholar

58. Luca A, Luca M, Kasper S, Pecorino B, Zohar J, Souery D, et al. Anhedonia is associated with a specific depression profile and poor antidepressant response. Int J Neuropsychopharmacol. (2024) 27. doi: 10.1093/ijnp/pyae055

PubMed Abstract | Crossref Full Text | Google Scholar

59. Strawbridge R, Carter B, Marwood L, Bandelow B, Tsapekos D, Nikolova VL, et al. Augmentation therapies for treatment-resistant depression: systematic review and meta-analysis. Br J Psychiatry. (2019) 214:42–51. doi: 10.1192/bjp.2018.233

PubMed Abstract | Crossref Full Text | Google Scholar

60. Scott F, Hampsey E, Gnanapragasam S, Carter B, Marwood L, Taylor RW, et al. Systematic review and meta-analysis of augmentation and combination treatments for early-stage treatment-resistant depression. J Psychopharmacol. (2023) 37:268–78. doi: 10.1177/02698811221104058

PubMed Abstract | Crossref Full Text | Google Scholar

61. Fanelli G, Domschke K, Minelli A, Gennarelli M, Martini P, Bortolomasi M, et al. A meta-analysis of polygenic risk scores for mood disorders, neuroticism, and schizophrenia in antidepressant response. Eur Neuropsychopharmacol. (2022) 55:86–95. doi: 10.1016/j.euroneuro.2021.11.005

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: depression, major depressive disorder (MDD), Maudsley staging method (MSM), remission, treatment-resistant depression (TRD)

Citation: Antoniou A, Pappa S, Dikeos D, Smyrnis N, Mendlevicz J, Serretti A, Cleare A and Ferentinos P (2026) Episode-wide Maudsley staging in treatment-resistant depression: a longitudinal tertiary-care comparison of dimensional and categorical outcomes. Front. Psychiatry 17:1769246. doi: 10.3389/fpsyt.2026.1769246

Received: 16 December 2025; Accepted: 27 January 2026; Revised: 26 January 2026;
Published: 11 February 2026.

Edited by:

Vassilis Martiadis, Asl Napoli 1 Centro, Italy

Reviewed by:

Fabiola Raffone, Asl Napoli 1 Centro, Italy
Mason T. Breitzig, Henry Ford Health System, United States

Copyright © 2026 Antoniou, Pappa, Dikeos, Smyrnis, Mendlevicz, Serretti, Cleare and Ferentinos. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Panagiotis Ferentinos, cGZlcmVudGlub3NAbWVkLnVvYS5ncg==

ORCID: Panagiotis Ferentinos, orcid.org/0000-0001-8531-5623

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.