Recommendations for selection and adaptation of rating scales for clinical studies of rapid-acting antidepressants

Yavorsky, Christian; Ballard, Elizabeth; Opler, Mark; Sedway, Jan; Targum, Steven D.; Lenderking, William

doi:10.3389/fpsyt.2023.1135828

REVIEW article

Front. Psychiatry, 02 June 2023

Sec. Psychopharmacology

Volume 14 - 2023 | https://doi.org/10.3389/fpsyt.2023.1135828

This article is part of the Research TopicPerspectives on New Fast-acting AntidepressantsView all 5 articles

Recommendations for selection and adaptation of rating scales for clinical studies of rapid-acting antidepressants

Christian Yavorsky¹^*

Elizabeth Ballard²

Mark Opler³

Jan Sedway³

Steven D. Targum⁴

William Lenderking⁵ for ISCTM Assessment Methods and Endpoints for Rapid-Acting Antidepressants-RAADs Working Group

¹Valis Biosciences, Inc., Berkeley, CA, United States
²NIH/NIMH, Bethesda, MD, United States
³WIRB Copernicus Group (WCG) Clinical Endpoint Solutions, Princeton, NJ, United States
⁴Signant Health, Blue Bell, PA, United States
⁵Evidera, Bethesda, MD, United States

The novel mechanisms of action (MOA) derived from some recently introduced molecular targets have led to regulatory approvals for rapid acting antidepressants (RAADs) that can generate responses within hours or days, rather than weeks or months. These novel targets include the N-methyl-D-glutamate receptor antagonist ketamine, along with its enantiomers and various derivatives, and the allosteric modulators of gamma-aminobutyric acid (GABA) receptors. There has also been a strong resurgence in interest in psychedelic compounds that impact a range of receptor sites including D1, 5-HT7, KOR, 5-HT5A, Sigma-1, NMDA, and BDNF. The RAADs developed from these novel targets have enabled successful treatment for difficult to treat depressed individuals and has generated a new wave of innovation in research and treatment. Despite the advances in the neurobiology and clinical treatment of mood disorders, we are still using rating instruments that were created decades ago for drugs from a different era (e.g., The Hamilton and Montgomery-Åsberg depression rating scales, HDRS, and MADRS) continue to be used. These rating instruments were designed to assess mood symptoms over a 7-day time frame. Consequently, the use of these rating instruments often requires modifications to address items that cannot be assessed in short time frames, such as the sleep and appetite items. This review describes the adaptative approaches that have been made with the existing scales to meet this need and examines additional domains such as daily activities, side effects, suicidal ideation and behavior, and role functioning. Recommendations for future studies are described, including the challenges related to implementation of these adapted measures and approaches to mitigation.

Introduction

Until 2019, approved antidepressant treatments targeted the biogenic amines and were based on assumed mechanisms of action (MOA) that had been the mainstay of drug development and the psychiatric treatment of mood disorders for decades. However, in March 2019 the regulatory approval of two new treatments (esketamine approved for treatment resistant depression and brexanolone approved for post-partum depression) moved the field beyond the MOA of prior decades (1). With new synaptic targets, such as the N-methyl-D-glutamate receptor antagonist (NMDA) and gamma-aminobutyric acid (GABA), the core assumptions about what antidepressants are and how they work have been challenged and the course of drug development in psychiatry has been altered.

One important distinction about these new treatments and conventional antidepressants is the speed with which they begin to ameliorate depressive symptoms. Standard antidepressant treatments, acting on dopamine, norepinephrine, and/or serotonin take several weeks to work, and the course of treatment may take many months. Additionally, 30–40% of depressed individuals are resistant to conventional SSRI/SNRI treatments and results from large population studies suggest that 65% of treated individuals will relapse within 3 months (2, 3). Alternatively, the rapid acting antidepressants (RAADs) have the potential to resolve symptoms in a much shorter timeframe. For instance, Opler et al. (4) described rapid positive responses to administration of 40 mg of intranasal ketamine under supervision after only 15 min. The pattern of symptom response may differ somewhat from the responses to the traditional, more conventional treatments. In addition to the reduction in sadness and anxiety, investigators have noted an ultrarapid transition from pessimism to a markedly improved outlook that includes the use of positive valence in language and the onset of what one treated individual described as “calm alertness.” This notion of “calm alertness” also appears to be an area for further development and discussion: are the changes seen in the symptom severities measured by these instruments the most salient for patients? Scales such as the Clinician Administered Dissociative States Scale (CADSS) (5), 5-Dimensional Altered States of Consciousness Rating Scale (5D-ASC) (4, 6, 7), Mystical Experience Questionnaire (MEQ) (8, 9), and others measure aspects of subjective experience important to treatment effect are often included in clinical trials. That said, they are often not included in primary analysis and exist as secondary outcome measures to asses risk. The literature around treatment response to RAADs and psychedelics in terms of these types of scales suggest strong correlations between domains like insight, feelings of unity and empathogenic effects [e.g., (10–13)] and antidepressant treatment response. Notably, Roseman et al. (13) found correlations of “oceanic boundlessness” (r = 44) on the 5D-ASC and reduction of depression severity as measured by the self-reported QIDS-16 and was also predictive of treatment response (based on a clinical response defined as ≥50% in QIDS-16SR at 5 weeks compared to baseline). Griffiths et al. (14) also explored immediate and persistent positive experiences that overlap somewhat with typical depression rating scales but capture phenomenon such as the above mentioned “unity” as well as “deeply felt positive mood” and positive attitudes toward self, life, and others. While not a focus of the working group at his time, the question of whether the current scales are measuring patient experience in a thorough and representative manner are critical and merit further discussion.

Prior eras of clinical trials saw innovation in psychiatry spur a set of new approaches to evaluation and assessment of efficacy and safety. Drug treatments for depression led Dr. Max Hamilton to create a new, clinician-rated scale that was specific to patients “suffering from affective disorder of the depressive type” (15). Hamilton developed the Hamilton Depression rating scale (HDRS) to facilitate communication between physicians and not for the serial assessment of mood symptoms in clinical trials. Twenty years later, a profusion of clinical trials for new candidate drugs required a more sensitive, broadly applicable rating instrument that would be capable not only of differentiating drug from placebo but assessing entirely different classes of treatment as well. Consequently, the Montgomery-Åsberg depression rating scale (MADRS) was developed by Montgomery and Åsberg (16) in a validation study. These investigators created a 10-item rating instrument that selected the 10 symptoms that were most sensitive to change during a clinical trial of antidepressants that were available at that time. Subsequently, new measurement tools flourished, including patient reported outcomes, such as the Beck Depression Inventory (1961). The HDRS and MADRS, further operationalized by Williams' (17) and the SIGMA by Williams and Kobak (18) formalized the use of structured interview guides, the practice of euthymic baseline comparisons, and helped standardize the past week as the reference period for depression assessments in clinical research. The use of functional assessment tools such as the Sheehan Disability Scale (SDS) with its DISCAN metric (19, 20), and the advent of new safety measures to capture suicidality, including the C-SSRS (21), S-STS (22), and others further established the core set of assessments which appeared to be sufficient, although concerns about the “gold standard” have been raised periodically [e.g., (23, 24)]. It is important to note that none of these instruments were developed for the assessments of RAADs.

The experience of the “first generation” of RAAD development has yielded some important lessons, principally about the best practices in the use of these existing scales. Under the auspices of the International Society of CNS Clinical Trials Methodology (ISCTM), a working group co-chaired by four of the authors (CY, EB, MO, and JS) was convened to evaluate the approaches undertaken to date. In the following sections, we will discuss some of the efforts made to adapt the existing scales for RAAD studies, the changes that have been made in each, and the evidence, if any for the effectiveness of those adaptations. Additionally, we will outline the development of new scales and the possibilities that may be afforded by novel digital measurements.

Defining the “RAPID-acting” paradigm: when is “rapid” and what is “action?”

The ISCTM working-group noted that “rapid-acting” is a challenging term to define and that there has been a lack of consensus in the field on the most appropriate criteria. Broadly speaking, “rapid-acting” may be taken to mean any measurable, significant improvement that manifests in shorter timeframes than those known to be associated with standard treatments. Gould et al. (25) indicate that “ideally, a rapid-acting antidepressant drug would exert its effects within hours or days, as opposed to weeks or months.” Standard evaluations in antidepressant trials have typically been conducted at weekly in-clinic visits. Clinicians are asked to summarize a week's experience of symptoms into a narrow quantitative measure despite the fact that symptom severity may vary from day to day. Nonetheless, the one-week time frame has been largely effective for the evaluation of conventional antidepressants because the clinical trials lasted for 4–8 weeks. However, the experience of clinicians from the past 60 years would suggest that detectable, meaningful symptomatic changes may occur in <1 week and that the 1-week time frame may be insufficient for new drug candidates. The emergence of RAADs have convinced everyone that shorter measurement intervals are needed. As noted above, results from studies of ketamine, arguably the archetype of this class of treatments, have shown early responses that are clinically meaningful to affected individuals. These findings have built a case for evaluative methods that are closer to real-time.

The advent of RAADs raises some unresolved questions about the criteria for clinically meaningful response. It is fairly well-understood that many RAADs like ketamine may have an early impact only certain domains of depressive symptoms [e.g., (26)], that some symptoms cannot be immediately measured (sleep, appetite), and that the duration of effect can be a week or less [e.g., (27)]. Hence, there are open questions about the requisite number of symptoms and the length of symptom resolution needed to call a treatment an antidepressant:

1. How many depressive symptoms need to resolve in order to describe a response as a clinically meaningful antidepressant response?

2. How long do the resolution of symptoms need to last in order to describe a response as a clinically meaningful antidepressant response?

The first question is compounded by the reality that not all depressed individuals have the same depressive symptoms. The second question is compounded by the issue of dosing frequency.

Defining what constitutes a clinically “significant” effect for RAADs seems simpler in some respects than defining what is clinically meaningful because there are well-established standards for statistical and clinical significance that have been used for many years by methodologists and regulators to evaluate evidence from clinical trials. Often, the threshold for antidepressant response has been ≥50% decrease from the baseline MADRS or HDRS score [e.g., (28)] whenever that occurs.

An ongoing debate within our workgroup reflects the dynamic nature of this concept of response as it applies to RAADs, specifically: should change be taken to mean rapid alleviation of symptoms that reach or approach statistical significance or should it require rapid remittal of symptoms below some pre-defined threshold? A further challenge, as noted in question 2 above is the need to demonstrate not merely early improvement, but also sustained relief. Is rapid, transient improvement acceptable or useful, given the risks associated with starting any new treatment with a patient who suffers with a mood disorder? Furthermore, the heterogeneity of symptom resolution also requires consideration; for example, improvements in subjective mood without functional improvements or reduction in suicidal ideation would generally not be considered adequate by most stakeholders. As these topics are still open for debate, we must be precise about the timeframe of assessment, the maintenance of response, and the domain under study in any discussion about the merits or limitations of RAADs.

In the conduct of this review, we have primarily focused on documenting the methods used for evaluation, the approaches taken to date in modifying standard assessments, and the evidence for their effectiveness. This work focuses on how investigators have chosen to alter rating scales for use in rapid-acting antidepressant studies, as well as the evidence for continued reliability and validity. We have included several of the major clinician-rated measures routinely used as primary efficacy endpoints, as well as the most widely utilized tools to evaluate suicidal ideation and behavior, functional status, quality of life, and side effects common to some classes of molecules, and other aspects of safety and efficacy.

Over the course of the working group's activity a number of alternatives were proposed and developed. In general terms these were:

• the adaptation of existing scales or measurement techniques

• the development of new scales

• and the inclusion of digital measurements to supplement existing scales.

Measuring symptom severity and efficacy of RAADs with existing rating instruments

MADRS/SIGMA

Singh et al. (29) used both full and abbreviated MADRS scores in the assessment of response to IV esketamine. They assessed depressive symptoms at 2, 4, and 24 h following the infusion and used the term “since the last assessment” for the 2 and 4-h periods where the sleep and appetite items were omitted. In Johnson et al. (30) they evaluated the suitability of the MADRS within a 24-h recall period as opposed to the standard reference period indicated above as the past week and found that this had equal content validity with high internal consistency reliability (Cronbach α of 0.84 and 0.91) and test-retest reliability (intraclass correlation coefficients of 0.96 and 0.91). According to their study “the majority of participants reported that a meaningful change in depression symptoms could be assessed in a 24-h recall period, except for reduced sleep and appetite.” This echoes Singh et al. (29) and their decision to omit these items for the shorter recall period. Using esketamine trial data, Yavorsky et al. (31), noted that the MADRS was sensitive to change across short periods though the sleep and suicide item did not appear to show significant sensitivity to change reinforcing the idea that rapid change is most often associated with mood symptoms. The conventional triad of depressive symptoms (mood, appetite, and sleep disturbances) does not apply neatly into a 24-h time frame for assessment of clinical change.

HDRS/SIGH-D

Bunney and Bunney (32), while investigating the rapid-acting antidepressant effect of ketamine paired with sleep modification, noted significant decreases in HAMD scores within a 48-h period using an unmodified version of the scale (and indicated that many patients met criteria for remission in this period). Their paper also provides a meaningful review of the literature around time-to-response for a range of standard antidepressant compounds. Luckenbaugh et al. (33) compared the use of the 17 item HDRS, and 8 other shortened versions of the HDRS, including one or two item measures within ketamine clinical trials. Overall, the one or two item scales demonstrated the smallest effect sizes to ketamine, while the 17-item HDRS was associated with the lowest response rates. They also noted that the “response rates for HDRS total score were the smallest of all the scales, including those for single item measures. This is most likely due to the fact that a few items are included in the total score that cannot change measurably over the brief time frame.” This again suggests the sensitivity of more durable symptoms may be limited while the alleviation of primary mood symptoms appears robust and measureable with these types of rapid-acting compounds. Milak et al. (34) employed a similar rationale in their use of the HDRS-24 that excluded those items for diurnal variation and loss of weight in their study examining the impact of ketamine on glutamate, glutamine (Glx) and γ-aminobutyric acid (GABA) level and symptoms of depression.

HAMD6

The six-item version of the HDRS was developed by Per Beck and is a valid and sensitive scale for assessing depressive symptoms in clinical trials (35, 36). The HAMD6 can be used for RAAD studies because it is designed to query symptoms experienced in the past 72 h and can be modified for shorter durations as well. The 6 items include depressed mood, loss of self-esteem, loss in interest, psychomotor retardation, psychic anxiety, and somatic symptoms. The HAMD6 does not inquire about sleep or appetite. In two clinical trials of RAADs, HAMD6 scores improved significantly more than placebo assigned subjects (37, 38). In one exploratory study of a potential RAAD (an mTORC1 activator), the candidate drug was significantly better than placebo on the HAMD6 at 4 and 12 h after dosing (37).

IDS/QIDS-SR16

The abbreviated form of the Quick Inventory of Depressive Symptoms (QIDS-SR16) by Rush et al. is a 16-item patient-rated instrument that has been shown to be sensitive change in patients undergoing RAAD treatment (39, 40). In both studies, participating patients completed the QIDS-SR16 ~2 days after each IV ketamine infusion.

Lenderking et al. (41) had previously developed the QIDS-SRD14 as a daily measure of depressive symptoms for the purpose of detecting rapid treatment effects. The scale takes the 16 items of the QIDS and eliminates the items pertaining to sleep and weight, rewording the other items to be relevant in a 24-h response window. The scale was used in a depression methods study and showed significantly faster detection of treatment onset and response than the HAM-D or MADRS administered weekly (42).

PHQ-9

The PHQ-9 is useful for screening individuals for depression (43). The PHQ-9 guidelines indicate that the symptoms should be measured over the “past 2 weeks” and in many studies [e.g., (44)], the instrument was used in this manner. Used in this manner, the PHQ-9 has shown sensitivity to change in response to RAAD treatment in an esketamine study [e.g., (45)]. The changes noted across the PHQ-9, MADRS and CGI-S in this study showed consistency in change scores from baseline to Day 15 and 28. While, Artin et al. (46) also reported similar findings with the PHQ-9 in terms of detection of change over time, we did not locate any studies that used a modified recall period for this instrument.

BDI

The Beck Depression Inventory (BDI)I has been shown to be sensitive to change in general in the treatment of depressed individuals. Wilkinson et al. (47), noted that the BDI was uniquely insensitive to suicidality (when compared to the HAMD, MADRS and QIDS-SR). Nonetheless, other studies [e.g., (48)] used it as a treatment outcome measure to assess changes over a 24-h period in response to ketamine treatment for MDD. Weigand et al. showed a significant change in BDI score from baseline was 34.1 (SD = 11.3) to 24.7 (SD = 10.0) 24 h after ketamine administration [t_{(1, 23)} = 4.1, P < 0.001].

Consideration of the adaptation of existing scales for RAAD trials

While several of the studies cited above employed abbreviated or otherwise modified versions of existing scales, there were none that specifically validated the versions that were used in their studies. The RAAD working group agreed that, ideally, for an appropriate psychometric validation of the modified HAMD and MADRS versions that have excluded sleep and appetite, an item-response theory (IRT) analysis should be conducted to determine the item-level contribution when used with these populations. Insofar as techniques that could be used for the development of additional scale in the investigation of rapid-response, the “discan” (discretized analog) method was suggested.¹ This method allows for a more clinically oriented categorization and, according to Singh and Bilsbury (49), it can be used for “obtaining reliable and precise measures of subjectively experienced dysfunction variables whose possible values form a continuum.”

Discretization is a term borrowed from applied mathematics and, in this context, translates to breaking down an essential concept (e.g., a continuous variable) into discrete (measurable) items that are familiar to psychometricians. While this method has been in practice since the mid-1980s and provides a more organized and replicable way to generate new scales, it has not yet been applied to the problem of measuring rapid acting anti-depressants and needs further research and attention.

Alternatively, the working group recognized that the HAMD-6 is already a well-validated instrument but noted that has had limited use in RAAD trials to date.

The development of new scales

The development of novel scales for the assessment of RAADs and other antidepressants has been underway. Sonya Eremenco (Executive Director of the Patient-Reported Outcome Consortium at The Critical Path Institute) presented her research to the ISCTM RAAD working group on a newly validated scale called the Symptoms of Major Depressive Disorder Scale (SMDDS) (see Appendix 1 for conceptual framework) that was developed in conjunction with the FDA and Critical Path Institute (C-PATH). This scale follows the work of McCarrier et al. (50) and Bushnell et al. (51) in which they outlined their considerations of FDA guidance around best practices as well as the development of a patient-reported outcome with high content validity. According to FDA documents (52) [Clinical Outcome Assessments (COA) Qualification Program DDT COA #000109]: “The intent is to use the SMDDS to assess treatment benefit in clinical trials for MDD therapies that may have a faster onset of action and potentially communicate this earlier antidepressant treatment benefit in the product label.” At this writing, the scale has not been used in a clinical trial with rapid-acting antidepressants but given its sensitivity to change over shorter time durations and specific construction for this purpose, it may play a role in studies using rapid-acting antidepressants moving forward.

McIntyre et al. (39, 53) developed the McIntyre And Rosenblat Rapid Response Scale (MARRRS) with the specific aim of measuring rapid treatment response. The MARRRS is a self-report measure that contains 14 items scored 0–4 based upon symptoms experienced in the past 72 h. The psychometric characteristics of the MARRRS appear very promising. In a validation study, the MARRRS had a high internal consistency across acute infusions as determined by Cronbach's alpha (0.84–0.94). There was significant convergent validity between the QIDS-SR-16 and MARRRS total scores across infusions [rs₍₂₉₂₎ = 0.87, p < 0.001]; the MARRRS was also sensitive to change [rs₍₄₉₎ = 0.70, p < 0.001]. It is noteworthy that an exploratory factor analysis showed that MARRRS items loaded onto two factors (i.e., dysphoria and psychic anxiety) accounting for 63.4% of the total variance.

This scale will be used in an upcoming Phase II trial (Psilocybin for Treatment-Resistant Depression: NCT05029466). Gukasyan et al. (54) indicate that psilocybin “produces substantial and rapid antidepressant effects in patients with major depressive disorder (MDD).”

Based on the ISCTM working group's consensus, the areas of depressive illness that appear to be most apt to change in short periods of time (e.g., depressed mood, anxiety, suicidal ideation, and anergia) mirror those items validated in the MARRRS.

The inclusion of digital methods to supplement existing scales

The working group expressed a great deal of interest in the exploration of digital measurements to assess RAADs, but the consensus agreed no single system was ready to be used as a primary tool in a clinical trial. Digital methods have the advantage of periodic clinician assessments by collecting moment to moment reports that might be more accurate because they are essentially “real time” measures obtained directly from the source (the participant). There have been significant advances in the use of digital methods as it applies to the measurement of depressive symptoms [e.g., (55–57)]. Onnela et al. demonstrated the utility of smartphone-based applications that allow for more ecologically valid assessment of depressive symptoms over time and in an Aledavood et al. (58) review, the prevalence and validity of a range of techniques for measuring symptoms using actigraphy and other features that are built into many smartphone platforms. To paraphrase Kamath et al. (59), non-standardized assessments that use ecological momentary assessments (EMA) and other digital phenotyping techniques usually lack validation but may provide important clinical information that is not generally accessible in the context of a study or clinic visit. Validation of EMA applications has begun in clinical trials for depression. In a open label antidepressant trial, Targum et al. (60) found consistent correlations between clinician reported outcome measures (HAMD-17 and HAMD-6) and an EMA derived HAMD-6 completed by the participating subject using a smartphone. They noted that the daily EMA reports anticipated clinical outcome before the clinician visits and demonstrated consistent symptomatic improvement or little clinical change from day to day. The use of EMA pre-randomization to establish critical baseline stability was suggested as essential in the study of RAADs.² This baseline stability is critical because EMA provides a “snapshot” of behavior that may not be the most accurate assessment of a patient's overall symptom severity, thus a demonstration of stability by multiple EMA entries will be important in the ability of researchers to assess the impact of a rapid-acting compound over time.

With advances in machine learning (ML) to analyze facial and vocal data [e.g., (61)] and the near ubiquity of smartphones to provide passive data there will no doubt be integration of these methods into upcoming trials in this space.

Conclusion

While this review shows that with the described adaptations, existing standard rating instruments can capture some symptomatic change in the timeframe in which RAADs act, they remain inherently limited by their structure (items like sleep and appetite), timeframe of retrospective review of symptoms (usually 7 days) and conceptual biases. It is also noteworthy that, to date, FDA has accepted only clinician rated outcome measures (HAMD, MADRS, and CDRS-R) in Phase III trials as primary endpoints to support an indication for mood disorders. All the traditional rating instruments described in this review have strong literature support for their validity, reliability and sensitivity to change, but these instruments have principally been conceptualized and tested under specific, older treatment paradigms (e.g., MAOIs, TCAs, and SSRIs). Despite past successes, the ISCTM working group concluded that it is likely that these measures fail to capture some aspects of the patient experience following treatment with RAADs.

Both the work of the Critical Path Institute and McIntyre et al. have shown that scales specific to the detection of change in depressive symptoms can be developed and have, in a preliminary way, been deployed into clinical trials. Given the psychometric properties and intent of their design for RAADs, both the SMDDS and MARRRS have significant promise.

Digital technologies including ecological momentary assessments (EMA), sleep and activity trackers as well as a range of applications that use facial and vocal metrics have been developed and should, initially, work to supplement and cross-check the items in existing or new scales that overlap. For EMA applications this may serve to provide more real-time assessments of the patient's mood states over the past week, while sleep and activity applications would seem to bolster or validate patient reports in these domains of depressive symptomology. Voice and facial metrics may be the closest to being sensitive to rapid changes in mood that can be reflected in affect or voice parameters.

Further development of new measures that meaningfully evaluate the novel effects of RAADs will require new approaches, as radical as the departure was from the older generations of drugs to the newer ones. The rapidity of effect that these new treatments exhibit may be the most obvious hallmarks of RAADs, but the speed with which they are reported to work might not be the feature which best distinguishes them. To ensure we are fully and objectively measuring what matters most to patients, clinicians and other stakeholders, our rating instruments and perhaps even our most basic assumptions about what they are and how they work may also have to adapt to keep up with the remarkable pace of change in the field of RAADs.

Author contributions

CY, EB, MO, JS, ST, and WL participated in the Rapid-Acting Anti-Depressants working group and in the research, compilation, and writing and editing of this manuscript. Additional credit is given to the ISCTM RAAD Working Group at large who were able to provide a broad range of perspectives from regulatory to academic to industry. All authors contributed to the article and approved the submitted version.

Conflict of interest

CY is the Chief Scientific Officer at Valis Biosciences, Inc. EB is the Director of Psychology and Behavior Research and the Director of Predoctoral Training at the Experimental Therapeutics and Pathophysiology Branch in the National Institute of Mental Health. MO is Chief Research Officer at WCG Clinical Endpoint Solutions. JS is Vice President of Clinical Science at WCG Clinical Endpoint Solutions. ST is Scientific director at Signant Health, Chief Medical Officer at Functional Neuromodulation LLC., and has had consulting relationships with Actinogen Medical Ltd., AZ Therapies, BioXcel Therapeutics, EMA Wellness, Denovo BioPharma, Frequency Therapeutics, Karuna Therapeutics Inc., Merck Pharmaceuticals, Navitor Pharmaceuticals, Neurocrine Bioscience, and Relmada Therapeutics within the past 3 years. WL is the Vice President and Executive Director of Patient Centered Research at Evidera, a PPD company.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpsyt.2023.1135828/full#supplementary-material

Footnotes

1. ^Suggested by working group member David Sheehan, MD.

2. ^Suggested by working group member Steve Targum, MD.

References

1. Cristea IA, Naudet FUS. Food and drug administration approval of esketamine and brexanolone. Lancet Psychiatry. (2019) 12:975–7. doi: 10.1016/S2215-0366(19)30292-5

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Rush A, Trivedi M, Wisniewski S, Nierenberg A, Stewart J, Warden D, et al. Acute and longer-term outcomes in depressed outpatients requiring one or several treatment steps: a STAR^*D report. Am J Psychiatry. (2006) 163:1905–17. doi: 10.1176/ajp.2006.163.11.1905

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Kautzky A, Baldinger-Melich P, Kranz GS, Vanicek T, Souery D, Montgomery S, et al. A new prediction model for evaluating treatment-resistant depression. J Clin Psychiatry. (2017) 78:215–22. doi: 10.4088/JCP.15m10381

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Opler L, Opler MG, Arnsten A. Ameliorating treatment-refractory depression with intranasal ketamine: potential NMDA receptor actions in the pain circuitry representing mental anguish. CNS Spectr. (2016) 21:12–22. doi: 10.1017/S1092852914000686

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Bremner JD, Krystal JH, Putnam FW, Southwick SM, Marmar C, Charney DS, et al. Measurement of dissociative states with the clinician-administered dissociative states scale (CADSS). J Trauma Stress. (1998) 11:125–36. doi: 10.1023/A:1024465317902

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Dittrich A, Lamparter D, Maurer M. 5D.-ASC: Questionnaire for the Assessment of Altered States of Consciousness. A Short Introduction 3rd Edn. Zürich: PSIN PLUS (2010).

Google Scholar

7. Dittrich A. The standardized psychometric assessment of altered states of consciousness (ASCs) in humans. Pharmacopsychiatry. (1998) 31:80–4. doi: 10.1055/s-2007-979351

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Pahnke WN. Drugs and Mysticism: An Analysis of the Relationship Between Psychedelic Drugs and the Mystical Consciousness. Cambridge, MA: Harvard University Press (1963).

Google Scholar

9. Pahnke WN. Psychedelic drugs and mystical experience. Int Psychiatry Clin. (1969) 5:149–62.

Google Scholar

10. Muscat S, Hartelius G, Crouch C, Morin K. An integrative approach to ketamine therapy may enhance multiple dimensions of efficacy: improving therapeutic outcomes with treatment resistant depression. Front Psychiatry. (2021) 12:710338. doi: 10.3389/fpsyt.2021.710338

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Davis A, Barrett F, Griffiths R. Psychological flexibility mediates the relations between acute psychedelic effects and subjective decreases in depression and anxiety. J Contextual Behav Sci. (2020) 15:39–45. doi: 10.1016/j.jcbs.2019.11.004

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Carhart-Harris RL, Bolstridge M, Day CMJ, Rucker J, Watts RR. Psilocybin with psychological support for treatment-resistant depression: Six-month follow-up. Psychopharmacol. (2018) 235:399–408. doi: 10.1007/s00213-017-4771-x

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Roseman L, Nutt D, Carhart-Harris R. Quality of acute psychedelic experience predicts therapeutic efficacy of psilocybin for treatment-resistant depression. Front Pharmacol. (2018) 8:974. doi: 10.3389/fphar.2017.00974

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Griffiths R, Johnson M, Richards W, Richards B, McCann U, Jesse R, et al. Psilocybin occasioned mystical-type experiences: immediate and persisting dose-related effects. Psychopharmacology. (2011) 218:649–65. doi: 10.1007/s00213-011-2358-5

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Hamilton M. A rating scale for depression. J Neurol Neurosurg Psychiatry. (1960) 23:56–62. doi: 10.1136/jnnp.23.1.56

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Montgomery SA, Åsberg M. A new depression scale designed to be sensitive to change. Br J Psychiatry. (1979) 134:382–9. doi: 10.1192/bjp.134.4.382

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Williams JB. A structured interview guide for the Hamilton Depression Rating Scale. Arch Gen Psychiatry. (1988) 45:742–7. doi: 10.1001/archpsyc.1988.01800320058007

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Williams JB, Kobak KA. Development and reliability of a structured interview guide for the Montgomery Åsberg Depression Rating Scale (SIGMA). Br J Psychiatry. (2008) 192:52–8. doi: 10.1192/bjp.bp.106.032532

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Sheehan DV. The Sheehan Disability Scales. In: Sheehan D, editors. The Anxiety Disease and How to Overcome It. New York, NY: Charles Scribner and Sons (1983). p. 151.

Google Scholar

20. Sheehan KH, Sheehan DV. Assessing treatment effects in clinical trials with the discan metric of the Sheehan Disability Scale. Int Clin Psychopharmacol. (2008) 23:70–83. doi: 10.1097/YIC.0b013e3282f2b4d6

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Posner K, Oquendo MA, Gould M, Stanley B, Davies M. Columbia classification algorithm of suicide assessment (C-CASA): classification of suicidal events in the FDA's pediatric suicidal risk analysis of antidepressants. Am J Psychiatry. (2007) 164:1035–43. doi: 10.1176/ajp.2007.164.7.1035

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Sheehan D, Alphs L, Mao L, Li Q, May R, Bruer E, et al. Comparative validation of the S-STS, the ISST-Plus, and the C–SSRS for assessing the suicidal thinking and behavior FDA 2012 suicidality categories. Innov Clin Neurosci. (2014)11:32.

PubMed Abstract | Google Scholar

23. Bagby R, Ryder A, Schuller D, Marshall M. The Hamilton Depression Rating Scale: has the gold standard become a lead weight? Am J Psychiatry. (2004) 161:2163–77. doi: 10.1176/appi.ajp.161.12.2163

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Giddens J, Sheehan K, Sheehan D. The Columbia-Suicide Severity Rating Scale (C–SSRS): Has the “gold standard” become a liability? Innov Clin Neurosci. (2014) 11:66–80.

PubMed Abstract | Google Scholar

25. Gould T, Zarate C, Thompson S. Molecular pharmacology and neurobiology of rapid-acting antidepressants. Annu Rev Pharmacol Toxicol. (2019) 59:213–36. doi: 10.1146/annurev-pharmtox-010617-052811

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Park L, Luckenbaugh D, Pennybaker S, Hopkins M, Henter I, Lener M, et al. The effects of ketamine on typical and atypical depressive symptoms. Acta Scandan. (2020) 142:394–401. doi: 10.1111/acps.13216

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Coyle CM, Laws KR. The use of ketamine as an antidepressant: a systematic review and meta-analysis. Hum Psychopharmacol. (2015) 30:152–63. doi: 10.1002/hup.2475

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Tuck A, Ghazali D. Ketamine as a rapid-acting antidepressant: promising clinical and basic research. Am J Psychiatry Resid J. (2017) 12:3–5. doi: 10.1176/appi.ajp-rj.2017.120302

CrossRef Full Text | Google Scholar

29. Singh J, Fedgchin M, Daly E, Xi L, Melman C, De Bruecker G, et al. Intravenous esketamine in adult treatment-resistant depression: a double-blind, double-randomization, placebo-controlled study. Biol Psychiatry. (2016) 80:424–31. doi: 10.1016/j.biopsych.2015.10.018

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Johnson K, Devine J, Ho K, Howard K, Saretsky T, Jamieson C, et al. Evidence to support Montgomery-Åsberg depression rating scale administration every 24 hours to assess rapid onset of treatment response. J Clin Psychiatry. (2016) 77:1681–6. doi: 10.4088/JCP.15m10253

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Yavorsky C, Singh J, Engelhardt N. Can the MADRS measure rapid changes in depressive symptoms in response to esketamine treatment? Neuropsychopharmacol. (2017) 42:S476–652. doi: 10.1038/npp.2017.266

CrossRef Full Text | Google Scholar

32. Bunney B, Bunney W. Rapid-acting antidepressant strategies: mechanisms of action. Int J Neuropsychopharmacol. (2012) 15:695–713. doi: 10.1017/S1461145711000927

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Luckenbaugh DA, Ameli R, Brutsche N, Zarate CA. Rating depression over brief time intervals with the Hamilton Depression Rating Scale: standard vs. abbreviated scales. J Psychiatr Res. (2015) 61:40–5. doi: 10.1016/j.jpsychires.2014.12.015

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Milak M, Rashid R, Dong Z, Kegeles L, Grunebaum M, Ogden T, et al. Assessment of relationship of ketamine dose with magnetic resonance spectroscopy of Glx and GABA responses in adults with major depression: A randomized clinical trial. JAMA Netw Open. (2020) 3:e2013211. doi: 10.1001/jamanetworkopen.2020.13211

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Timmerby N, Andersen JH, Søndergaard S, Østergaard SD, Bech PA. Systematic review of the clinimetric properties of the 6-item version of the hamilton depression rating scale (HAM-D6). Psychother Psychosom. (2017) 86:141–9. doi: 10.1159/000457131

PubMed Abstract | CrossRef Full Text | Google Scholar

36. O'Sullivan RL, Fava M, Agustin C, Baer L, Rosenbaum JF. Sensitivity of the six-item Hamilton Depression Rating Scale. Acta Psychiatr Scand. (1997) 95:379–84. doi: 10.1111/j.1600-0447.1997.tb09649.x

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Targum S, Leventer S, Hughes T, Owen R, Vlasuk G. NV-5138 a novel, direct activator of the mechanistic target of rapamycin complex 1 (mTORC1): a phase 1b randomized, double-blind, placebo-controlled single oral dose study in subjects with treatment-resistant depression (TRD). Am Coll Neuropsychopharmac.(2019) 44:126–7. doi: 10.1038/s41386-019-0545-y

PubMed Abstract | CrossRef Full Text | Google Scholar

38. Fava M, Freeman MP, Flynn M, Judge H, Hoeppner BB, Cusin C, et al. Double-blind, placebo-controlled, dose-ranging trial of intravenous ketamine as adjunctive therapy in treatment-resistant depression (TRD). Mol Psychiatry. (2020) 25:1592–603. doi: 10.1038/s41380-018-0256-5

PubMed Abstract | CrossRef Full Text | Google Scholar

39. McIntyre R, Lipsitz O, Lui LMW, Rodrigues N, Gill H, Nasri F, et al. The meaningful change threshold as measured by the 16-item quick inventory of depressive symptomatology in adults with treatment-resistant major depressive and bipolar disorder receiving intravenous ketamine. J Affect Disord. (2021) 294:592–6. doi: 10.1016/j.jad.2021.07.035

PubMed Abstract | CrossRef Full Text | Google Scholar

40. Lipsitz O, McIntyre RS, Rodrigues NB, Kaster TS, Cha DS, Brietzke E, et al. Early symptomatic improvements as a predictor of response to repeated-dose intravenous ketamine: Results from the Canadian Rapid Treatment Center of Excellence. Prog Neuropsychopharmacol Biol Psychiatry. (2021) 105:110126. doi: 10.1016/j.pnpbp.2020.110126

PubMed Abstract | CrossRef Full Text | Google Scholar

41. Lenderking WR, Hu M, Cappelleri JC, Rush AJ. “Reliability, validity, and responsiveness of the QIDS-SR16 daily version,” In: 12th Annual International Society Of Quality-Of-Life Research Annual Meeting. San Francisco, CA.

Google Scholar

42. Lenderking WR, Hu M, Tennen H, Cappelleri JC, Petrie CD, Rush AJ, et al. Daily process methodology for measuring earlier antidepressant response. Contemp Clin Trials. (2008) 29:867–77. doi: 10.1016/j.cct.2008.05.012

PubMed Abstract | CrossRef Full Text | Google Scholar

43. Kroenke K, Spitzer R, Williams JB. The PHQ-9: validity of a brief depression severity measure. J Gen Intern Med. (2001) 16:606–13. doi: 10.1046/j.1525-1497.2001.016009606.x

PubMed Abstract | CrossRef Full Text | Google Scholar

44. Hudgens S, Floden L, Blackowicz M, Jamieson C, Popova V, Fedgchin M, et al. Meaningful change in depression symptoms assessed with the patient health questionnaire (PHQ-9) and Montgomery-Åsberg depression rating scale (MADRS) among patients with treatment resistant depression in two, randomized, double-blind, active-controlled trials of esketamine nasal spray combined with a new oral antidepressant. J Affect Disord. (2021) 281:767–75. doi: 10.1016/j.jad.2020.11.066

PubMed Abstract | CrossRef Full Text | Google Scholar

45. Singh J, Fedgchin M, Daly E, Drevets W. Chapter Eight: relapse prevention in treatment-resistant major depressive disorder with rapid-acting antidepressants. In:Duman RS, Krystal JH, , editors. Advances in Pharmacology. Vol. 89. New Haven, CT: Academic Press (2020). p. 237–259.

Google Scholar

46. Artin H, Bentley S, Mehaffey E, Liu FX, Sojourner K, Bismark AW, et al. Effects of intranasal (S)-ketamine on Veterans with co-morbid treatment-resistant depression and PTSD: a retrospective case series. EClinicalMedicine. (2022) 48:101439. doi: 10.1016/j.eclinm.2022.101439

PubMed Abstract | CrossRef Full Text | Google Scholar

47. Wilkinson S, Ballard E, Bloch M, Mathew S, Murrough J, Feder A, et al. The effect of a single dose of intravenous ketamine on suicidal ideation: a systematic review and individual participant data meta-analysis. Am J Psychiatry. (2018) 175:150–8. doi: 10.1176/appi.ajp.2017.17040472

PubMed Abstract | CrossRef Full Text | Google Scholar

48. Weigand A, Gärtner M, Scheidegger M, Wyss PO, Henning A, Seifritz E, et al. Predicting antidepressant effects of ketamine: The role of the pregenual anterior cingulate cortex as a multimodal neuroimaging biomarker. Int J Neuropsychopharmacol. (2022) 25:1003–13. doi: 10.1093/ijnp/pyac049

PubMed Abstract | CrossRef Full Text | Google Scholar

49. Singh, A, Bilsbury, C. Psychology Estimating levels of subjectively experienced states on DISCAN Scales by RCS: A repeated comparison system. Pain. (1985) 23:314–5. doi: 10.1016/0304-3959(85)90137-X

CrossRef Full Text | Google Scholar

50. McCarrier K, Deal L, Abraham L, Blum S, Bush E, Martin M, et al. Patient-centered research to support the development of the symptoms of major depressive disorder scale (SMDDS): initial qualitative research. Patient. (2016) 9:117–34. doi: 10.1007/s40271-015-0132-1

PubMed Abstract | CrossRef Full Text | Google Scholar

51. Bushnell D, McCarrier K, Bush E, Abraham L, Jamieson C, McDougall F, et al. Symptoms of major depressive disorder scale: performance of a novel patient-reported symptom measure. Value Health. (2019) 22:906–15. doi: 10.1016/j.jval.2019.02.010

PubMed Abstract | CrossRef Full Text | Google Scholar

52. FDA. FDA Clinical Outcome Assessments (COA) Qualification Program Submissions Page. Food and Drug Administration. Clinical Outcome Assessments (COA) Qualification Program DDT COA #000109 (2021).

Google Scholar

53. McIntyre R, Rodrigues N, Lipsitz O, Lee Y, Cha DS, Gill H, et al. Validation of the McIntyre and rosenblat rapid response scale (MARRRS) in adults with treatment-resistant depression receiving intravenous ketamine treatment. J Affect Disord. (2021) 288:210–6. doi: 10.1016/j.jad.2021.03.053

PubMed Abstract | CrossRef Full Text | Google Scholar

54. Gukasyan N, Davis AK, Barrett FS, et al. Efficacy and safety of psilocybin assisted treatment for major depressive disorder: Prospective 12-month follow-up. J Psychopharmacol. (2022) 36:151–8. doi: 10.1177/02698811211073759

PubMed Abstract | CrossRef Full Text | Google Scholar

55. Pellegrini AM, Huang EJ, Staples PC, Hart KL, Lorme JM, Brown HE, et al. Estimating longitudinal depressive symptoms from smartphone data in a transdiagnostic cohort. Brain Behav. (2022) 12:e02077. doi: 10.1002/brb3.2077

PubMed Abstract | CrossRef Full Text | Google Scholar

56. Chandler C, Foltz P, Cohen A, Holmlund T, Cheng J, Bernstein J, et al. Machine learning for ambulatory applications of neuropsychological testing. Intell Based Med. (2020) 1–2:100006. doi: 10.1016/j.ibmed.2020.100006

CrossRef Full Text | Google Scholar

57. Anzar A, Sauder C, Yadav V, Koesmahargyo V, Aghjayan A, Marecki S, et al. Remote digital measurement of facial and vocal markers of major depressive disorder severity and treatment response: A pilot study. Front Digit Health. (2021) 3:610006. doi: 10.3389/fdgth.2021.610006

PubMed Abstract | CrossRef Full Text | Google Scholar

58. Aledavood T, Torous J, Triana-Hoyos A, Naslund J, Onnela JP, Keshavan M, et al. Smartphone-based tracking of sleep in depression, anxiety, and psychotic disorders. Curr Psychiatry. (2019) 21:49. doi: 10.1007/s11920-019-1043-y

PubMed Abstract | CrossRef Full Text | Google Scholar

59. Kamath J, Leon Barriera R, Jain N, Keisari E, Wang B. Digital phenotyping in depression diagnostics: integrating psychiatric and engineering perspectives. World J Psychiatry. (2022) 12:393–409. doi: 10.5498/wjp.v12.i3.393

PubMed Abstract | CrossRef Full Text | Google Scholar

60. Targum S, Sauder C, Evans M, Saber J, Harvey PD. Ecological momentary assessment as a measurement tool in depression trials. J Psychiatr Res. (2021) 136:256–64. doi: 10.1016/j.jpsychires.2021.02.012

PubMed Abstract | CrossRef Full Text | Google Scholar

61. Neumann M, Roessler O, Suendermann-Oeft D, Ramanarayanan V. On the utility of audiovisual dialog technologies and signal analytics for real-time remote monitoring of depression biomarkers. In: Proceedings of the First Workshop on Natural Language Processing for Medical Conversations. Stroudsburg, PA: Association for Computational Linguistics (2020). p. 47–52.

Google Scholar

Keywords: rapid acting anti-depressants, psychometrics, measurement of rapid response to anti-depressant treatment, ISCTM working group, novel anti-depressants, ketamine, psychedelics

Citation: Yavorsky C, Ballard E, Opler M, Sedway J, Targum SD and Lenderking W (2023) Recommendations for selection and adaptation of rating scales for clinical studies of rapid-acting antidepressants. Front. Psychiatry 14:1135828. doi: 10.3389/fpsyt.2023.1135828

Received: 01 January 2023; Accepted: 02 May 2023;
Published: 02 June 2023.

Edited by:

Acioly L. T. Lacerda, Federal University of São Paulo, Brazil

Reviewed by:

Scott Tyler Aaronson, Sheppard and Enoch Pratt Hospital, United States
Li Yin, Sichuan University, China

Copyright © 2023 Yavorsky, Ballard, Opler, Sedway, Targum and Lenderking. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Christian Yavorsky, Y2hyaXN0aWFuQHZhbGlzYmlvc2NpZW5jZS5jb20=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.