Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Sleep, 11 September 2025

Sec. Sleep and Circadian Rhythms

Volume 4 - 2025 | https://doi.org/10.3389/frsle.2025.1648256

This article is part of the Research TopicThe Promise of Sleep TechnologyView all 9 articles

Performance evaluation of a novel screening tool for depressive episodes in COMISA: a comparative analysis with standard neuropsychometric assessments

  • 1Clinical Research and Development Department, Medibio Limited, Savage, MN, United States
  • 2Department of Biomedical Sciences, Humanitas University, Milan, Italy
  • 3Department of Clinical Neurosciences, Villa San Benedetto Menni Hospital, Hermanas Hospitalarias, Albese con Cassano, Italy
  • 4Personalized Medicine Center for Anxiety and Panic Disorders, Humanitas San Pio X Hospital, Milan, Italy
  • 5VA Greater Los Angeles Healthcare System, University of California, Los Angeles, Los Angeles, CA, United States

Introduction: Insomnia and obstructive sleep apnea (OSA) are each linked to elevated risks of depression. When comorbid (COMISA), these risks increase further, highlighting the need for effective depression screening. This study evaluated the screening accuracy of a novel software, MEB-001, for detecting current major depressive episode (cMDE) in individuals identified as with and without suspected COMISA (sCOMISA).

Methods: We conducted a retrospective sub-analysis from a prospective multicenter study at U.S. sleep clinics, including 296 adults who underwent routine polysomnography (PSG). Electrocardiogram and electroencephalogram signals, along with items 1 and 2 of the self-report depression screener Patient Health Questionnaire, 9 items (PHQ-9), were used to generate MEB-001 screening output. The Mini International Neuropsychiatric Interview (MINI) served as the diagnostic reference for cMDE. Sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) were calculated for MEB-001 and PHQ-9 (cut-off ≥10), with subgroup comparisons conducted using Fisher's exact and McNemar's tests (p < 0.05).

Results: MINI identified cMDE in 15.5% of participants (16.9% in sCOMISA; 14.2% in non-COMISA). Against the MINI, MEB-001 achieved 84.8% sensitivity, 72.0% specificity, 35.8% PPV, and 96.3% NPV in the full cohort; PHQ-9 ≥10 showed similar performance (89.1, 68.4, 34.2, and 97.2%, respectively). MEB-01's performance did not differ between sCOMISA and non-sCOMISA (all p ≥ 0.68), and no significant McNemar differences were found in the subgroups (p > 0.19).

Discussion: MEB-001 demonstrated consistent cMDE screening performance comparable to the PHQ-9, supporting its potential utility in sleep clinic settings.

1 Introduction

Co-morbid insomnia and obstructive sleep apnea (COMISA) is a common and debilitating condition that leads to compounded difficulties in patients' sleep, daytime performance, and overall quality of life compared to those with either insomnia or obstructive sleep apnea (OSA) alone, making diagnosis and treatment decisions more complex for healthcare providers (Sweetman et al., 2019, 2017). Epidemiological findings reported that 39–58% of OSA patients report insomnia symptoms, and 29−67% of insomnia patients fulfill minimal criteria for OSA (Luyster et al., 2010). Between 2013 and 2018, several large cluster analyses of OSA samples revealed that 32% to 54% of patients experienced 'disturbed sleep,' primarily characterized by nocturnal insomnia (Ye et al., 2013; Gagnadoux et al., 2016). A recent systematic review and meta-analysis by Zhang and colleagues, which included 37 studies on insomnia and OSA co-morbidity, found that 35% of insomnia patients have an AHI ≥5 indicating the presence of mild OSA, 29% have an AHI ≥15 indicating the presence of a moderate condition, and 38% of OSA patients meet insomnia criteria (Zhang et al., 2019).

Both insomnia and OSA have been associated with psychiatric symptoms, including depression and anxiety (Luo et al., 2025). Studies reported that OSA patients report notably higher depressive symptoms compared to non-OSA cohorts, with prevalence estimates of between 17 and 48% (Harris et al., 2009). This is markedly elevated when compared to the global lifetime prevalence of depression, which ranges from 3 to 17% (Andrade et al., 2003).

Similarly, chronic insomnia frequently coexists with various psychiatric and physical disorders, notably depression. It often precedes and serves as a risk factor for the onset of depression and is recognized as both a risk factor and a symptom of the disorder. Consistent with this, studies have shown that treating insomnia, whether through pharmacotherapy or psychological interventions such as Cognitive Behavioral Therapy for Insomnia (CBT-I) (Koffel et al., 2018; Cheung et al., 2014; Qaseem et al., 2016; Araujo et al., 2017), can lead to significant reductions in depressive symptoms.

The burden of depressive symptoms associated with insomnia and OSA increases when these two conditions coexist. Smith et al. (2004) found that sleep clinic patients with comorbid insomnia reported moderate to severe depression, anxiety, and stress, unlike those with OSA alone, who showed non-clinical levels (Smith et al., 2004). Similarly, Lang et al. reported higher rates of depressive symptoms in individuals with COMISA (42.6%) compared to those with insomnia (21.6%) or OSA alone (8.4%), despite similar severity of sleep-related symptoms across groups (Lang et al., 2017).

More recently, data from the National Health and Nutrition Examination Survey indicated that individuals with probable COMISA were about three times more likely to report suicidal ideation and/or self-harm, with severe cases strongly associated with COMISA (Kalra et al., 2024).

A prospective study of 309 U.S. military personnel found that 32.7% had insomnia, 30.4% had OSA, and 36.9% had COMISA. Compared to those with OSA alone, individuals with insomnia or COMISA showed greater insomnia severity, poorer sleep quality, more disruptive nocturnal behaviors, and higher rates of PTSD, fatigue, anxiety, depression, and nightmare disorder (Mysliwiec et al., 2022).

Given the high comorbidity between COMISA and depression—and its significant impact on mental and physical wellbeing—depression in individuals with COMISA is a critical clinical concern. It amplifies the negative effects of sleep disturbances, contributing to poorer health outcomes. Effectively addressing depression in this population is essential to improving quality of life and mitigating related health risks. Emerging evidence suggests that individuals with COMISA face a greater risk of depression, anxiety, stress, and reduced quality of life compared to those with either insomnia or OSA alone (Sweetman et al., 2019, 2017; Smith et al., 2004; Lang et al., 2017; Krakow et al., 2001; Yang et al., 2011).

The U.S. Preventive Services Task Force (USPSTF) recommends depression screening for all adults who have not been previously diagnosed with a mental health disorder. This screening is typically conducted using the self-reported Patient Health Questionnaire, 9 items (PHQ-9) (Sheehan et al., 1998), which the USPSTF recognizes as the gold standard for the screening of current major depressive episode (cMDE) (Maurer et al., 2018; US Preventive Services Task Force, 2002; Mulvaney-Day et al., 2018).

Current depression screening tools rely on self-report measures, which, despite their general accuracy, are less effective in sleep clinic settings (Lequerica et al., 2022). For instance, Items 3 and 4 of the PHQ-9 may overestimate sleep issues, inflating scores and increasing false positives (Lequerica et al., 2022; Kroenke and Spitzer, 2002). Item 9, assessing suicidal ideation, raises legal and ethical concerns for clinicians. Additionally, mismatches between the PHQ-9′s two-week validity and delays in polysomnography (PSG) due to insurance can reduce screening accuracy (Ma et al., 2021).

While structured clinical interviews offer more reliable diagnoses, they require timely administration by trained professionals, adding to already strained clinical and financial resources. As such, an automated PSG-based depression screener could provide a more efficient, accurate, and timely alternative for identifying depression in sleep disorder populations.

To address the limitations of existing depression screening tools, this study evaluated the screening performance of a novel software, MEB-001, for detecting cMDE in individuals with and without suspected COMISA referred for sleep studies at multiple U.S. sleep clinics. MEB-001 utilizes ECG signals, EEG bands, sleep macrostructure, and responses to PHQ-9 Items 1 and 2, collected during routine inpatient PSG. Specifically, we examined whether combining physiological features with core depressive symptom reports, analyzed through machine learning algorithms, can effectively screen for cMDE at a level comparable to or better than the most widely used depression screening tool, the PHQ-9, when benchmarked against a structured clinical diagnostic reference.

We expect that MEB-001 will demonstrate high accuracy in screening for cMDE, performing comparably to, or exceeding, the PHQ-9 when validated against a structured clinical diagnostic interview.

2 Methods

A retrospective sub-analysis was conducted on data from a prior single-arm, prospective, multicenter trial known as “Sleep Signal Analysis for Current Major Depressive Episode (SAMDE),” registered with ClinicalTrials.gov (NCT05708222). This study aimed to evaluate the screening efficacy of MEB-001 for cMDE in individuals exhibiting and not exhibiting suspected comorbid insomnia and sleep apnea (COMISA) across a cohort of United States sleep clinics. This study received approval from the WGC Institutional Review Board (212 Carnegie Center, Suite 301, Princeton, NJ 08540 Tracking No. 20223322), and was conducted under the principles of the United States Food and Drug Administration (https://www.regulations.gov/docket/FDA-2011-D-0567).

Participants undergoing inpatient polysomnography (PSG) for suspected primary or secondary sleep disorders were enrolled based on predefined inclusion and exclusion criteria. Inclusion criteria were: age between 22 and 75 years; undergoing PSG for suspected sleep disorders—including, but not limited to, sleep-related breathing disorders, sleep-related movement disorders, circadian rhythm sleep-wake disorders (intrinsic and extrinsic), hypersomnia, parasomnia, and insomnia; ability and willingness to provide informed consent; and capacity to follow study procedures and instructions. Exclusion criteria included the presence of a pacemaker, history of heart transplantation, or undergoing a full-night C-PAP titration study.

The recruitment process commenced on June 6, 2023, and concluded on July 9, 2024, across multiple sites within the United States. Three sleep clinics were located in Texas, three in North Carolina, two in South Carolina, and one in Minnesota and Ohio, respectively.

Upon enrollment at the sleep clinic and immediately before their polysomnography (PSG) study, subjects completed a self-administered questionnaire collecting demographic and clinical information, including age, sex, medical history, comorbidities, and current medications. Subsequently, each participant completed the self-administered PHQ-9 via an electronic data capture (EDC) system on a tablet, from which the two essential items for depression screening were derived, serving as the comparator screening test for the algorithm development. The digital platform automatically generated a score, adhering to the PHQ-9 scoring guidelines established by Kroenke, Spitzer, and Williams in 2001. In addition, always before the PSG study, a fully structured interview was also administered to each patient, utilizing the Mini International Neuropsychiatric Interview (MINI). This interview was conducted by a certified, MINI-trained nurse practitioner via teleconference and aimed to diagnose the cMDE, which served as the reference standard for evaluating the screening performance of MEB-001. The nurse practitioners who administered the MINI were blind to each participant's PHQ-9 score and the MEB-001 output.

2.1 Mini international neuropsychiatric interview

The MINI Adult Version (Sheehan et al., 1998) is a structured diagnostic interview designed to assess major psychiatric disorders based on the DSM-5 (American Psychiatric Association, 2022) and ICD-10 (International Classification of Diseases, 2024; World Health Organization, 1993).

Widely used in mental health settings worldwide, it provides a brief yet comprehensive tool for efficiently identifying conditions such as mood, anxiety, psychotic, and substance use disorders. The interview consists of standardized yes/no questions, allowing for rapid administration in clinical and research contexts. Each disorder is evaluated through specific modules, with questions aligned with DSM-5 criteria for accurate diagnosis.

Validation and reliability studies of the MINI demonstrated a sensitivity of 0.96, a specificity of 0.88, a positive predictive value of 0.87, a negative predictive value of 0.97, an inter-rater Kappa of 1.00, and a test/retest Kappa of 0.87 in the diagnosis of major depressive disorder (MDD) (Sheehan et al., 1998) when compared to clinician-administered semi-structured interviews, specifically the Structured Clinical Interview (SCID) (Spitzer et al., 1992).

Comparisons between the Mini-International Neuropsychiatric Interview (MINI) and unstructured interviews conducted by mood disorder specialists have demonstrated a sensitivity of 0.87, a specificity of 0.63, a positive predictive value of 0.84, a negative predictive value of 0.68, and a Cohen's kappa of 0.51 (Rodrigues et al., 2024). Furthermore, the MINI has exhibited robust inter-rater reliability, even when administered by individuals without specialized mental health expertise (Sheehan et al., 1998; Rodrigues et al., 2024; de Azevedo Marques and Zuardi, 2008).

2.2 Patient health questionnaire (PHQ-9)

The PHQ-9 (Spitzer et al., 1999; Kroenke et al., 2001) is a validated 9-item self-report instrument widely utilized in the evaluation of depression across major sociodemographic segments within the U.S., with no significant variations in performance observed among these groups.

The nine items correspond to the nine DSM-5 criteria for a major depressive episode, as defined by the American Psychiatric Association (2022). Response options for each item are scored on a scale from “not at all” (0) to “nearly every day” (Luyster et al., 2010), quantifying the frequency with which each symptom has affected the respondent within the preceding 2-week period. The score derived from the nine items ranges from 0 to 27. Scores of 5, 10, 15, and 20 are indicative of mild, moderate, moderately severe, and severe depression symptoms, respectively. A score of 10 or higher is considered a threshold, signifying a potentially clinically significant condition, specifically a possible cMDE.

A recent and extensive Individual Participant Data Meta-analysis (Negeri et al., 2021), encompassing approximately 44,500 participants, evaluated the accuracy of the PHQ-9 in identifying cMDE by comparison with established reference standards. Specifically, in comparison to semi-structured psychiatric diagnostic interviews, sensitivity and specificity for a cutoff of ≥ 10 (95% CI) were determined to be 0.88 (0.82–0.92) and 0.86 (0.82–0.88), respectively. In the context of fully structured interviews, sensitivity ranged from 0.67 (0.57–0.76) to 0.75 (0.66–0.82), while specificity ranged from 0.86 (0.80–0.90) to 0.88 (0.84–0.91). In general, the PHQ-9 demonstrated acceptable accuracy in detecting depressive episodes. Concurrently, a score of ≥ 10 has been associated with an increased risk of major depression by a factor exceeding 2.6 (Kroenke et al., 2001).

Based on the substantial body of scientific evidence regarding the PHQ-9, the application of this cut-off threshold (≥ 10) is recommended as the most reliable approach for screening purposes in clinical practice and clinical trials (He et al., 2020).

2.3 OSA, suspected insomnia, and suspected COMISA determination

OSA was diagnosed based on the final clinical assessment made by a sleep specialist following interpretation of polysomnography (PSG) results. Insomnia symptoms were evaluated using a self-administered clinical questionnaire completed prior to the sleep study. The questionnaire included four items addressing sleep difficulties: trouble initiating or falling asleep, frequent nocturnal awakenings, early morning awakenings, and difficulty returning to sleep after nighttime awakenings. Each item was rated on a four-point Likert scale (“Never,” “Sometimes,” “Often,” “Always”). Participants reporting at least one of these symptoms as occurring “Often” or “Always” were classified as having suspected insomnia. Suspected COMISA (co-morbid insomnia and sleep apnea) was identified when both OSA and suspected insomnia criteria were met. Because insomnia was assessed via self-report rather than clinical evaluation, the COMISA classification should be considered provisional.

The Insomnia Severity Index (ISI), commonly used to identify insomnia in population-based samples and evaluate treatment response, and the Epworth Sleepiness Scale (ESS), used to assess daytime sleepiness, were not administered. As no formal clinical diagnosis of insomnia was performed, both insomnia and COMISA were considered suspected rather than confirmed diagnoses.

2.4 European data format (EDF) files

The physiological data recorded during PSG were extracted as EDF files. These files comprised full-night digital PSG recordings, conducted in accordance with the American Academy of Sleep Medicine (AASM) guidelines (American Academy of Sleep Medicine, 2023) and acquired utilizing a variety of PSG systems, including the SomnoStar 10.2 Sleep Scoring System (Vyaire Medical, Inc.), Philips Respironics Sleepware (G3), Natus Neurology Sandman Elite (10.1), and Compumedics Profusion PSG4 (V4.5).

Of the multiple physiological channels recorded during PSG, six electroencephalography (EEG) montages (F4A1, C4A1, O2A1, F3A2, C3A2, and O1A2; minimum sampling rate: 200 Hz) and the lead-II electrocardiography (ECG) channel (minimum sampling rate: 200 Hz) were employed as input for the MEB-001 software. All EDF and associated files underwent de-identification through the removal of personally identifiable information and were assigned a sequential identification code prior to collection.

2.5 MEB-001 software

This study presents a novel technology for the objective screening of active depressive symptoms in sleep centers, utilizing EEG and electrocardiogram (ECG) signals derived from PSG, in conjunction with two key components of the PHQ-9, specifically items 1 and 2 (Figure 1).

Figure 1
Flowchart depicting the operation of MEB-001 Software. Inputs include EDF files from a 510(k) cleared medical device, PHQ-9 items 1 and 2, and patient ID. The software parses EEG and ECG signals, producing SEEG and SHRV data, which feed into NSSD and a report generator. Outputs are patient ID, test date, and cMDE results, indicating if criteria for current MDE are met, not met, or unable to assess.

Figure 1. MEB-001 software overview. The MEB-001 diagram illustrates the software inputs and processing, which integrate EEG and ECG signals from PSG with Items 1 and 2 from the PHQ-9 to screen for cMDE in sleep centers. The three processing modules, SEEG, SHRV, and NSSD, are responsible for sleep staging based on EEG signals, calculating heart rate and heart rate variability throughout sleep stages, and determining the final cMDE, respectively.

MEB-001, a software application currently in development, is designed to screen for cMDE in adult patients aged 22 to 75 years who are undergoing PSG for suspected sleep apnea. The software analyzes physiological data recorded during PSG, specifically six EEG derivations—three primary (F4A1, C4A1, O2A1) and three backup (F3A2, C3A2, O1A2)—as per the guidelines established by the AASM (American Academy of Sleep Medicine, 2023), and the lead-II ECG derivation. These signals are extracted from EDF files containing the complete physiological recordings from the PSG. A minimum sampling rate of 200 Hz is required for both EEG and ECG derivations. Furthermore, MEB-001 incorporates patient responses to PHQ-9 items 1 and 2, which are collected prior to the PSG. The software also necessitates the input of “lights-off” and “lights-on” times to delineate the initiation and termination of the sleep study.

The software commences its analysis with an automated sleep staging derived from six EEG channels. This module utilizes an automated sleep staging algorithm, which, as demonstrated in a prior investigation, exhibited substantial concordance with manual sleep staging conducted by independent sleep technologists (Grassi et al., 2023). The EEG signals undergo processing via a sequence of artificial intelligence, specifically machine learning and deep learning, algorithms that categorize each 30-s epoch into one of five sleep stages: wakefulness, N1, N2, N3, and REM. Concurrently, spectral analysis of the EEG data is executed to generate sleep macrostructure indices that depict sleep patterns observed throughout the polysomnography. Subsequent to sleep staging, the ensuing processing module incorporates ECG data to ascertain heart rate (HR) and heart rate variability (HRV) across discrete sleep stages. A range of HRV indices, encompassing time-domain, frequency-domain, and non-linear domain metrics, are computed for each respective stage.

In the final processing phase, select EEG and ECG-derived variables, previously obtained, are integrated with scores from PHQ-9 items 1 and 2. An ensemble of machine learning algorithms analyzes these inputs, subsequently generating a binary classification of cMDE status. In the event that EEG or ECG signals are significantly corrupted or deemed unusable, MEB-001 will produce an “Unable to calculate cMDE” output.

The machine learning algorithm employed in this module was trained using data acquired during Phase 1 of the Sleep Analysis Major of Depression Episode (SAMDE) study. Notably, the datasets utilized for the development and training of the machine learning algorithms, which constitute the various modules of MEB-001, were distinct from the dataset employed for testing in this particular study. The algorithms were finalized prior to the commencement of the study analysis and remained unaltered following exposure to the study data. Certain sleep clinics engaged in this study, specifically those located in Minnesota, Ohio, North Carolina, and South Carolina, also contributed to data acquisition during the software's developmental phase, whereas the clinics situated in Texas did not participate in such data collection.

2.6 Statistical analysis

The screening accuracy of the MEB-001 software and the PHQ-9, relative to the MINI, was evaluated using sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV). Sensitivity was defined as the proportion of true positives (TP) among all individuals with the condition, calculated as TP/(TP + FN). Specificity referred to the proportion of true negatives (TN) among individuals without the condition, calculated as TN/(TN + FP). The PPV was the proportion of true positives among all test-positive cases, calculated as TP/(TP + FP), while the NPV represented the proportion of true negatives among all test-negative cases, calculated as TN/(TN + FN). In these formulas, FN refers to false negatives and FP to false positives. Exact 95% confidence intervals (CIs) were computed using the Clopper–Pearson (exact binomial) method.

Analyses were conducted on the full sample and stratified by OSA, suspected COMISA, and suspected insomnia diagnoses groups. Fisher's exact test was employed to compare sensitivity, specificity, PPV, and NPV between subgroups. For each metric, 2 × 2 contingency tables were constructed separately for individuals with and without suspected COMISA, based on classifications from the screening tools relative to the MINI. Sensitivity comparisons assessed differences in the proportion of true positives among MINI-positive cases (TP + FN); specificity comparisons examined the proportion of true negatives among MINI-negative cases (TN + FP); PPV comparisons evaluated the proportion of true positives among test-positive cases (TP + FP); and NPV comparisons assessed the proportion of true negatives among test-negative cases (TN + FN).

Additionally, McNemar's test was used to compare the paired screening performance of MEB-001 and PHQ-9. The analysis was stratified by cMDE diagnosis according to the MINI, and discordant classifications between the two tools were examined among true positive and true negative cases. This analysis was also conducted separately within the two COMISA subgroups.

All statistical tests were two-sided, with a significance threshold set at p < 0.05. Analyses were performed using Python (version 3.12.1) (Python Software Foundation, 2024).

3 Results

A total of 395 participants satisfied the study's inclusion and exclusion criteria and provided informed consent. Of these, 29 individuals were excluded based on clinical considerations. Conversely, 40 individuals were excluded as a result of PSG-related technical deficiencies, including but not limited to suboptimal signal quality or the recording of less than 6 h of data. Consequently, a final sample of 326 subjects was deemed usable for the study. These 326 subjects were recruited from multiple sleep clinics throughout the United States, specifically: three clinics in Texas (n = 62, 60, 31), three in North Carolina (n = 28, 3, 2), two in South Carolina (n = 95, 32), one in Minnesota (n = 10), and one in Ohio (n = 3).

Thirty cases (9.23% of the processed data) received an “Unable to assess current cMDE” determination from MEB-001 due to significant signal anomalies. Specifically, issues were identified in EEG and sleep staging for 8 cases (2.46%) and in ECG and HR/HRV data for 22 cases (6.77%). As a result, these 30 cases were excluded from further analysis. The remaining 296 cases were used to evaluate the performance of MEB-001 (Figure 2). A detailed summary of their characteristics, including cMDE prevalence based on the MINI and PHQ-9 assessments, is provided in Table 1.

Figure 2
Flowchart depicting a study process. Initially, 395 subjects were recruited. Out of these, 69 were excluded due to clinical criteria (29) and PSG technical issues (40). Thus, 326 were analyzed with MEB-001. Subsequently, 30 more were excluded for being unable to calculate cMDE, resulting in 296 subjects included in the final analysis.

Figure 2. Study flowchart.

Table 1
www.frontiersin.org

Table 1. Descriptive statistics.

The participant cohort was evenly divided into two groups, with 148 participants exhibiting suspected COMISA and 148 participants not exhibiting suspected COMISA. A total of 46 subjects received a cMDE diagnosis according to the MINI, indicating a prevalence of 15.54% (95% CI: 11.61%−20.18%). Within the group with suspected COMISA, 25 subjects were diagnosed with cMDE, resulting in a prevalence of 16.89% (95% CI: 11.24%−23.92%). In the group without suspected COMISA, 21 subjects were diagnosed with cMDE, yielding a prevalence of 14.19% (95% CI: 9.00%−20.87%). Table 1 provides a comprehensive summary of the analyzed subjects' characteristics, both overall and stratified by suspected COMISA status.

The prevalence of cMDE based on software, PHQ-9 cut-off of 10, and MINI interviews in the total sample and in the subgroups is presented in Table 2, and the accuracy of MEB-001 in detecting cMDE was assessed in the entire sample and OSA, suspected insomnia, and suspected COMISA, diagnoses groups is presented in Tables 3, 4. No statistically significant differences were observed between the subgroups in sensitivity (p = 0.686), specificity (p = 0.675), PPV (p = 0.691), or NPV (p = 1.000).

Table 2
www.frontiersin.org

Table 2. Prevalence of cMDE based on different determination strategies.

Table 3
www.frontiersin.org

Table 3. Screening performance of MEB-001 and PHQ-9 (cut- off ≥ 10) relative to MINI.

Table 4
www.frontiersin.org

Table 4. Confusion matrices for MEB-001 and PHQ-9 (cut-off ≥ 10) in identifying current major depressive episode.

The accuracy of the PHQ-9 (cut-off ≥10) was evaluated across the full sample and subgroups (Table 3). No statistically significant differences were observed between the subgroups in sensitivity (p = 1.000), specificity (p = 0.176), PPV (p = 0.849), or NPV (p = 0.665).

The paired screening performance of MEB-001 and PHQ-9 was directly compared using McNemar's test across the full sample and within COMISA subgroups.

In the full sample, no significant differences were observed between MEB-001 and PHQ-9 in the proportion of correctly identified cases of current major depressive episode (cMDE) for participants with cMDE (p = 0.414) or without cMDE (p = 0.198), as determined by the MINI.

Within the subgroup of participants with suspected COMISA, the proportion of correctly classified individuals also did not significantly differ for those with cMDE (p = 1.000) or without cMDE (p = 0.144).

Similarly, in the non-COMISA subgroup, no significant differences were found in the correct identification of participants with cMDE (p = 0.317) or without cMDE (p = 0.818).

Table 5 displays the agreement and disagreement rates between MEB-001 and PHQ-9, considering both the entire sample and each of the two COMISA subgroups.

Table 5
www.frontiersin.org

Table 5. Agreement and disagreement rates between MEB-001 and PHQ-9 (cut-off ≥ 10) for detecting current major depressive episode.

4 Discussion

This study evaluated the performance of a novel software, MEB-001, which combines physiological features with core depressive symptom reports and applies ML algorithms to detect cMDE in individuals with and without suspected COMISA referred for sleep studies at multiple U.S. clinics. Its screening accuracy was compared to the widely used PHQ-9, using a structured clinical diagnostic interview as the reference standard.

According to our results, a cMDE was diagnosed in 15.54% of the sample based on MINI interviews, with similar prevalence rates observed among participants with suspected COMISA (16.89%) and those without (14.19%). These rates are higher than those reported in general population studies, where point prevalence estimates for major depression typically range from 8 to 13% (Weinberger et al., 2018; Brody et al., 2018), indicating a heightened risk of depression within sleep clinic populations and underscoring the critical need for systematic screening in these settings. In contrast, screening tools indicated higher prevalence rates: the MEB-001 identified 36.82% of the total participants as having cMDE, with 34.46 and 39.19% in the respective subgroups. Similarly, using the PHQ-9 with a cut-off score of 10 or higher, 40.54% of participants screened positive for cMDE, including 36.49 and 44.59% in the two subgroups. It is well recognized that screening tools tend to overestimate the prevalence of conditions like cMDE compared to structured diagnostic interviews. This inflation occurs because screening instruments are designed to be highly sensitive to identify potential cases but may lack specificity, resulting in false positives. Consistent with this, both MEB-001 and PHQ-9 (with a cutoff ≥10) demonstrated high sensitivity (MEB-001: 84.8%; PHQ-9: 89.1%) but comparatively lower specificity (MEB-001: 72%; PHQ-9: 68.4%). Therefore, prevalence rates obtained through screening are generally higher than those derived from diagnostic assessments. This pattern aligns with previous research highlighting the tendency of screening measures to produce elevated prevalence estimates in epidemiological studies (Levis et al., 2019). Along with a higher sensitivity, a clinically effective screening instrument is characterized by a strong NPV, which indicates its ability to reliably rule out the condition when the test result is negative (Ben-Haim and Dacso, 2024; Trevethan, 2017). Both the MEB-001 and PHQ-9 demonstrated comparable NPVs (96.3 and 97.2% respectively) compared to the MINI-based diagnosis of cMDE, underscoring their utility in accurately identifying individuals without the disorder.

Consistent with our findings, a recent study demonstrated that in OSA patients, the coexistence of both insomnia symptoms and daytime sleepiness defines a distinct phenotype associated with a higher prevalence and greater severity of depressive symptoms. This work underscores the clinical heterogeneity of OSA and suggests that comorbid insomnia may amplify the risk of depression, highlighting the critical need for comprehensive symptom assessment to improve depression management in this population (Gabryelska et al., 2024a).

Recent evidence suggests a biological link between circadian clock dysregulation and depressive symptoms in individuals with OSA. In a study assessing circadian gene expression and depression severity using the Montgomery–Åsberg Depression Rating Scale (MADRS), a significant positive correlation was observed between morning expression levels of several circadian genes and the severity of depressive symptoms in OSA patients. Among these, morning expression of the PER1 gene emerged as an independent predictor of depression severity, even after adjusting for insomnia and chronotype-related variables. These findings support the hypothesis that disrupted circadian regulation may contribute to the pathophysiology of depression in OSA, offering a potential mechanistic explanation for the high comorbidity observed between these conditions (Gabryelska et al., 2024b).

Our findings herein demonstrate that both the MEB-001 and the PHQ-9 are robust screening instruments for cMDE in sleep clinic settings and exhibit comparable efficacy across diverse subgroups, encompassing both individuals with and those without suspected COMISA.

While both MEB-001 and PHQ-9 are valuable tools for screening cMDE, MEB-001 may outperform PHQ-9 in certain contexts due to its unique approach. Unlike the PHQ-9, which relies solely on self-reported symptoms, MEB-001 combines physiological features with core depressive symptom reports and applies ML algorithms for detection. This multi-dimensional approach helps mitigate limitations of self-report measures, such as underreporting due to stigma or social desirability bias. As a result, MEB-001 may offer more accurate detection in populations where mental health stigma is a barrier to honest disclosure. Additionally, by incorporating objective physiological data, MEB-001 can enhance screening accuracy in settings where literacy or familiarity with mental health concepts is limited, reducing misinterpretation common in traditional questionnaires. The use of objective, automated assessment may also help reduce legal and clinical risk—particularly related to responses to item 9 of the PHQ-9, which addresses suicidal ideation and may require urgent follow-up based on subjective interpretation. Furthermore, MEB-001 can lower the burden on clinicians by supporting efficient, standardized assessments. Therefore, MEB-001 offers distinct advantages in clinical and community contexts that demand sensitive, reliable, and scalable tools for the screening of depressive episodes.

It is noteworthy that the PPVs for both MEB-001 and PHQ-9 ≥10 were relatively low, at 35.8 and 34.2%, respectively. It is essential to acknowledge that the prevalence of a condition within the study population has a direct effect on the PPV. A diminished PPV suggests a higher incidence of false-positive results. When prevalence is low, even a highly specific test yields relatively few TP compared with FP, lowering PPV. In our sample (46 cMDE cases among 296 participants), this scarcity of true cases likely accounts for the modest PPV observed.

No significant differences were found in the screening performance of either the MEB-001 software or the PHQ-9 (cut-off ≥10) between subgroups (OSA, suspected insomnia diagnoses, and suspected COMISA). Both tools showed comparable sensitivity, specificity, PPV, and NPV across the groups. Additionally, direct comparisons of classification accuracy using McNemar's test revealed no significant differences between MEB-001 and PHQ-9 in the overall sample or within the subgroups. Our results showed that MEB-001 was juxtaposed with the PHQ-9 within subjects with or without suspected COMISA.

In literature, some studies have used ML to achieve the identification of subjects with depression using HR and HRV metrics and the input of the algorithm. For example, Shaw et al. used single-lead ECG recordings to differentiate individuals with comorbid OSA and depression from those with OSA without depression, achieving with this model an accuracy of 78.18%, sensitivity of 73.91%, specificity of 81.25%, and precision of 73.91% (Shaw et al., 2024). Another study analyzed 5-min HRV segments from sleep-stage ECG recordings in 40 MDD patients and 40 controls, reporting 85.9% sensitivity and 86.5% specificity using a Bayesian-optimized ensemble of extremely randomized trees (Geng et al., 2023). While these studies suggest the potential for identifying autonomic signatures of depression during sleep, they have notable limitations: both relied on the PHQ-9 for depression classification rather than structured diagnostic interviews, which are considered the gold standard; used cross-validation exclusively instead of independent test sets; and involved relatively small sample sizes.

Our study addresses these limitations by employing MINI-structured interviews as the reference standard, evaluating performance on an independent hold-out test set, and utilizing a significantly larger sample size, thereby enhancing the robustness and generalizability of our findings.

In addition to clinical utility, the potential cost-effectiveness of MEB-001 and its implications for resource allocation in sleep clinics warrant consideration. As an automated tool, MEB-001 could reduce the need for additional clinical staff time and follow-up appointments by streamlining depression screening within existing workflows, which is particularly valuable in resource-constrained settings. Its scalability may also support broader mental health integration into sleep medicine without imposing significant operational burdens. Looking ahead, future research should focus on the real-world deployment of MEB-001, including prospective validation in diverse clinical settings. Studies evaluating its long-term impact on patient outcomes, workflow efficiency, and healthcare resource utilization will be essential to fully establish its value and inform adoption in routine practice.

5 Limitations

Several methodological features of our study may help explain this discrepancy and the observed lack of a significant difference between groups. First, the diagnosis of COMISA in our study was based on a suspected rather than a confirmed clinical classification of insomnia. Insomnia symptoms were assessed via a brief self-reported questionnaire embedded in the intake form, without using validated instruments such as the ISI. Consequently, the insomnia component of COMISA may have been under- or misclassified, potentially introducing misclassification bias and diluting the observed differences in cMDE prevalence between groups. Future studies employing standardized and validated insomnia assessment tools could provide more accurate classification and strengthen the findings. Second, the broad inclusion criteria—encompassing a wide range of sleep disorders—may have contributed to a more heterogeneous sample, reducing the ability to detect contrasts specifically attributable to COMISA. Unlike studies that compare COMISA specifically to pure OSA or pure insomnia groups, our sample likely included individuals with overlapping or ambiguous sleep symptoms, further minimizing distinctions. Third, the use of MINI as the diagnostic reference for depression, while robust in validity, captures only categorical diagnoses of cMDE and may miss subthreshold depressive symptoms. Fourth, it is worth considering that both groups—COMISA and non-COMISA—may share common pathways of sleep fragmentation, circadian disruption, or fatigue that independently contribute to depressive risk, thereby flattening differences between them in this particular sample. Fifth, MEB-001 software was unable to provide a cMDE determination for roughly 9% of the subjects. This limitation could result in a small number of patients not receiving algorithm-based screening in a practical setting. Finally, although our analysis revealed no significant differences in performance between MEB-001 and PHQ-9, either in the full sample or within subgroups, this result may partly be attributable to the limited sample size and consequent reduction in statistical power.

6 Conclusion

Our results highlight the need for depression screening in the SCs. The failure of clinicians to identify acute depressive symptoms concurrent with sleep disturbances produces adverse clinical consequences, resulting in postponed diagnoses, management, and treatment of mental health disorders, notably MDD (Substance Abuse and Mental Health Services Administration, 2022). The absence of uniform mental health screening instruments in sleep clinics increases the probability of misdiagnosing and managing comorbid sleep disorders such as COMISA, which, due to the high prevalence of cMDE, can negatively impact treatment protocols and patient outcomes (Trevethan, 2017; Daccò et al., 2023).

Prior investigations and established guidelines advocate for the implementation of routine depression screening within sleep clinics to mitigate this concern. Consequently, certain authorities have underscored the necessity of standardized depression screening in these settings (Trevethan, 2017; Rosenberg, 2003). Conversely, prevailing guidelines stipulate psychological screening only for particular sleep-wake disorders, such as insomnia, rather than as a universally applied procedure (Trevethan, 2017; Rosenberg, 2003). However, a significant amount of evidence suggests that protracted periods of untreated depression and delayed administration of antidepressant therapy are correlated with suboptimal prognoses. Moreover, the inappropriate prescription of hypnotics and sedatives for the management of sleep disturbances in individuals with depression has been associated with an increased incidence of depression exacerbation and recurrence (Geng et al., 2023; Inada et al., 2021).

In light of the substantial prevalence of depressive, anxious, and stress-related symptomatology among individuals with COMISA (Smith et al., 2004; Lang et al., 2017), along with the complexities of its diagnosis, management, and treatment (Sweetman et al., 2019, 2017; Inada et al., 2021), a robust recommendation for depressive symptom screening is hereby asserted. The detection of these symptoms facilitates the capacity of clinicians to ascertain the optimal therapeutic regimen, thereby augmenting the efficacy of sleep and mental health management.

As previously discussed, traditional depression screening tools—such as self-report questionnaires—have played a valuable role in addressing the limitations of relying exclusively on clinical judgment, particularly when assessments are conducted by non-specialist providers. However, these tools also present well-documented limitations. They are vulnerable to subjective biases, such as the underreporting or minimization of symptoms, fluctuations in patient engagement or motivation, and the risk of incomplete or inaccurate responses. While generally less time-consuming than structured clinical interviews, self-report instruments still require clinicians to allocate time to explain the rationale for the questionnaire and provide clear instructions, which can pose a practical burden in high-demand clinical environments.

In light of these challenges, integrating automated depression screening into routine care has received growing support. Reliable tools that incorporate objective physiological signals offer a promising alternative to traditional self-report methods. By reducing subjective bias, improving diagnostic consistency, and easing clinician workload, such systems can enhance early detection and management of depression. Ultimately, embedding automated, evidence-based tools into clinical workflows may promote more efficient, equitable, and scalable mental health care. By reducing reliance on clinician-administered assessments, such tools can streamline screening processes and enable faster triage of at-risk individuals. The use of objective physiological data also promotes more consistent and standardized evaluations, minimizing subjectivity and helping to reach individuals who may underreport symptoms due to stigma or low mental health literacy. Furthermore, the scalability of automated systems supports broader implementation across diverse clinical settings, including those with limited access to mental health professionals. Together, these advantages position MEB-001 as a potentially valuable tool for enhancing depression detection and addressing mental health needs more effectively within routine medical care.

An important future direction involves the integration of MEB-001 into electronic health records and existing PSG systems. Embedding the tool directly into clinical workflows could enable prompt analysis of physiological and sleep data, with automated depression risk assessments delivered alongside standard PSG reports. Such integration would facilitate timely clinical decision-making, streamline documentation, and support coordinated care by making mental health screening results readily accessible to both sleep specialists and referring providers. Ultimately, interoperability with electronic health records and PSG platforms could enhance the scalability and impact of digital screening tools like MEB-001 in routine sleep medicine practice.

To conclude, the present analysis demonstrates that a novel software application, MEB-001—integrating both subjective and objective physiological data—offers an effective approach for screening cMDE in sleep clinic settings. The MEB-001 software showed comparable efficacy across different patient subgroups, including those with and without suspected COMISA, and performed on par with the PHQ-9, the current gold standard for depression screening in primary care and specialty settings. These findings support the potential utility of automated screening tools to facilitate the timely, efficient, and scalable detection of depression in sleep medicine populations, thereby addressing existing barriers to mental health assessment in these clinical environments. However, we recognize the critical importance of independent, external replication to validate the findings presented here. Future studies led by independent researchers are essential to confirm the robustness, generalizability, and clinical utility of MEB-001, ensuring unbiased evaluation and facilitating broader acceptance within the scientific and medical communities.

Data availability statement

The datasets presented in this article are not readily available because the dataset is Medibio Limited property. Requests to access the datasets should be directed to bWFzc2ltaWxpYW5vLmdyYXNzaUBtZWRpYmlvLmNvbS5hdQ==.

Ethics statement

The SAMDE study, which involved humans, was approved by WGC Institutional Review Board (212 Carnegie Center, Suite 301, Princeton, NJ 08540 Tracking No. 20223322). The study was conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.

Author contributions

AD: Writing – original draft, Conceptualization, Formal analysis, Funding acquisition, Methodology. MG: Conceptualization, Data curation, Formal analysis, Methodology, Software, Validation, Writing – original draft, Writing – review & editing. SD: Data curation, Writing – original draft, Writing – review & editing. JM: Writing – review & editing. DC: Writing – review & editing. GP: Writing – review & editing.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. Medibio Limited sponsored the costs associated with data acquisition and the personnel involved in data collection for this study. JM was supported by a VA HSRD Research Career Scientist Award (RCS 20-101; PI: Martin).

Acknowledgments

We thank Ernie Tursich, MS, Regulatory Affairs and Services, for his contributions to the research through editing, graphics, and table design.

Conflict of interest

AD, MG, and SD were employed by Medibio Limited. JM has received research funding from Medibio Limited for a separate study.

The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that Gen AI was used in the creation of this manuscript. The author(s) used OpenAI's ChatGPT to assist with English language editing and copyediting of this manuscript. The tool was not used for other purposes.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

American Academy of Sleep Medicine. (2023). The AASM Manual for the Scoring of Sleep and Associated Events: Rules, Terminology and Technical Specifications. Version 3. Darien, IL: American Academy of Sleep Medicine.

PubMed Abstract | Google Scholar

American Psychiatric Association. (2022). Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition, Text Revision (DSM-5-TR). Washington, DC, USA: American Psychiatric Publishing. doi: 10.1176/appi.books.9780890425787

PubMed Abstract | Crossref Full Text | Google Scholar

Andrade, L., Caraveo-Anduaga, J. J., Berglund, P., Bijl, R. V., De Graaf, R., Vollebergh, W., et al. (2003). The epidemiology of major depressive episodes: results from the international consortium of psychiatric epidemiology (ICPE) surveys. Int. J. Methods Psychiatr. Res. 12:3e21. doi: 10.1002/mpr.138

PubMed Abstract | Crossref Full Text | Google Scholar

Araujo, T., Jarrin, D. C., Leanza, Y., Vallieres, A., and Morin, C. M. (2017). Qualitative studies of insomnia: current state of knowledge in the field. Sleep Med. Rev. 31, 58–69. doi: 10.1016/j.smrv.2016.01.003

PubMed Abstract | Crossref Full Text | Google Scholar

Ben-Haim, Y., and Dacso, C. C. (2024). Interpreting PPV and NPV of diagnostic tests with uncertain prevalence. Rambam Maimonides Med. J. 15:e0013. doi: 10.5041/RMMJ.10527

PubMed Abstract | Crossref Full Text | Google Scholar

Brody, D. J., Pratt, L. A., and Hughes, J. P. (2018). Prevalence of depression among adults aged 20 and over: United States, 2013–2016. NCHS Data Brief 303, 1–8.

Google Scholar

Cheung, J. M., Atternäs, K., Melchior, M., Marshall, N. S., Fois, R. A., and Saini, B. (2014). Primary health care practitioner perspectives on the management of insomnia: a pilot study. Aust. J. Prim. Health. 20, 103–112. doi: 10.1071/PY12021

PubMed Abstract | Crossref Full Text | Google Scholar

Daccò, S., Caldirola, D., Grassi, M., Alciati, A., Perna, G., and Defillo, A. (2023). High prevalence of major depression in US sleep clinics: the need for routine depression screening in sleep services. J. Clin. Sleep Med. 19, 835–836. doi: 10.5664/jcsm.10398

PubMed Abstract | Crossref Full Text | Google Scholar

de Azevedo Marques, J. M., and Zuardi, A. W. (2008). Validity and applicability of the mini international neuropsychiatric interview administered by family medicine residents in primary health care in Brazil. Gen. Hosp. Psychiatry 30, 303–310. doi: 10.1016/j.genhosppsych.2008.02.001

PubMed Abstract | Crossref Full Text | Google Scholar

Gabryelska, A., Turkiewicz, S., Białasiewicz, P., Grzybowski, F., Strzelecki, D., and Sochal, M. (2024a). Evaluation of daytime sleepiness and insomnia symptoms in OSA patients with a characterization of symptom-defined phenotypes and their involvement in depression comorbidity-a cross-sectional clinical study. Front. Psychiatry 15:1303778. doi: 10.3389/fpsyt.2024.1303778

PubMed Abstract | Crossref Full Text | Google Scholar

Gabryelska, A., Turkiewicz, S., Kaczmarski, P., Gajewski, A., Białasiewicz, P., Strzelecki, D., et al. (2024b). Circadian clock dysregulation: a potential mechanism of depression in obstructive sleep apnea patients. Transl. Psychiatry 14:423. doi: 10.1038/s41398-024-03134-0

PubMed Abstract | Crossref Full Text | Google Scholar

Gagnadoux, F., Le Vaillant, M., Paris, A., Pigeanne, T., Leclair-Visonneau, L., Bizieux-Thaminy, A., et al. (2016). Relationship between OSA clinical phenotypes and CPAP treatment outcomes. Chest 149, 288–290. doi: 10.1016/j.chest.2015.09.032

PubMed Abstract | Crossref Full Text | Google Scholar

Geng, D., An, Q., Fu, Z., Wang, C., and An, H. (2023). Identification of major depression patients using machine-learning models based on heart-rate variability during sleep stages for pre-hospital screening. Comput. Biol. Med. 162:107060. doi: 10.1016/j.compbiomed.2023.107060

PubMed Abstract | Crossref Full Text | Google Scholar

Grassi, M., Daccò, S., Caldirola, D., Perna, G., Schruers, K., and Defillo, A. (2023). Enhanced sleep staging with artificial intelligence: a validation study of new software for sleep scoring. Front. Artif. Intell. 6:1278593. doi: 10.3389/frai.2023.1278593

PubMed Abstract | Crossref Full Text | Google Scholar

Harris, M., Glozier, N., Ratnavadivel, R., and Grunstein, R. R. (2009). Obstructive sleep apnea and depression. Sleep Med. Rev. 13:437e44. doi: 10.1016/j.smrv.2009.04.001

PubMed Abstract | Crossref Full Text | Google Scholar

He, C., Levis, B., Riehm, K. E., Saadat, N., Levis, A. W., Azar, M., et al. (2020). The accuracy of the patient health questionnaire-9 algorithm for screening to detect major depression: an individual participant data meta-analysis. Psychother. Psychosom. 89, 25–37. doi: 10.1159/000502294

PubMed Abstract | Crossref Full Text | Google Scholar

Inada, K., Enomoto, M., Yamato, K., Marumoto, T., Takeshima, M., and Mishima, K. (2021). Effect of residual insomnia and use of hypnotics on relapse of depression: a retrospective cohort study using a health insurance claims database. J. Affect. Disord. 281, 539–546. doi: 10.1016/j.jad.2020.12.040

PubMed Abstract | Crossref Full Text | Google Scholar

International Classification of Diseases. (2024). USA version. Tenth Edition. Available online at: www.cdc.gov/nchs/icd/icd-10-cm/index.html (Accessed April 4, 2025).

Google Scholar

Kalra, S., Nagaraja, N., and Kalra, D. (2024). Increased risk of suicidal ideation and/or self-harm in co-morbid insomnia and obstructive sleep apnea (COMISA). Sleep Epidemiol. 4:100088. doi: 10.1016/j.sleepe.2024.100088

Crossref Full Text | Google Scholar

Koffel, E., Bramoweth, A. D., and Ulmer, C. S. (2018). Increasing access to and utilization of cognitive behavioral therapy for insomnia (CBT-I): a narrative review. J. Gen. Intern. Med. 33, 955–962. doi: 10.1007/s11606-018-4390-1

PubMed Abstract | Crossref Full Text | Google Scholar

Krakow, B., Melendrez, D., Ferreira, E., Clark, J., Warner, T. D., Sisley, B., et al. (2001). Prevalence of insomnia symptoms in patients with sleep-disordered breathing. Chest 120, 1923–1929. doi: 10.1378/chest.120.6.1923

PubMed Abstract | Crossref Full Text | Google Scholar

Kroenke, K., and Spitzer, R. L. (2002). The PHQ-9: a new depression diagnostic and severity measure. Psychiatric Ann. 32, 509–515. doi: 10.3928/0048-5713-20020901-06

Crossref Full Text | Google Scholar

Kroenke, K., Spitzer, R. L., and Williams, J. B. (2001). The PHQ-9: validity of a brief depression severity measure. J. Gen. Intern. Med. 16, 606–613. doi: 10.1046/j.1525-1497.2001.016009606.x

PubMed Abstract | Crossref Full Text | Google Scholar

Lang, C. J., Appleton, S. L., Vakulin, A., McEvoy, R. D., Wittert, G. A., Martin, S. A., et al. (2017). Co-morbid OSA and insomnia increase depression prevalence and severity in men. Respirology 22, 1407–1415. doi: 10.1111/resp.13064

PubMed Abstract | Crossref Full Text | Google Scholar

Lequerica, A. H., Watson, E., Dijkers, M. P., Goldin, Y., Hoffman, J. M., Niemeier, J. P., et al. (2022). The utility of the patient health questionnaire (PHQ-9) sleep disturbance item as a screener for insomnia in individuals with moderate to severe traumatic brain injury. J. Head Trauma Rehabil. 37, E383–E389. doi: 10.1097/HTR.0000000000000746

PubMed Abstract | Crossref Full Text | Google Scholar

Levis, B., Benedetti, A., and Thombs, B. D. (2019). DEPRESsion screening data (DEPRESSD) collaboration. accuracy of patient health questionnaire-9 (PHQ-9) for screening to detect major depression: individual participant data meta-analysis. BMJ 365:l1476. doi: 10.1136/bmj.l1476

PubMed Abstract | Crossref Full Text | Google Scholar

Luo, X., Li, S., Wu, Q., Xu, Y., Fang, R., Cheng, Y., et al. (2025). Depressive, anxiety, and sleep disturbance symptoms in patients with obstructive sleep apnea: a network analysis perspective. BMC Psychiatry 25:77. doi: 10.1186/s12888-025-06532-w

PubMed Abstract | Crossref Full Text | Google Scholar

Luyster, F. S., Buysse, D. J., and Strollo, P. J. (2010). Comorbid insomnia and obstructive sleep apnea: challenges for clinical practice and research. J. Clin. Sleep Med. 6, 196–204. doi: 10.5664/jcsm.27772

Crossref Full Text | Google Scholar

Ma, S., Yang, J., Yang, B., Kang, L., Wang, P., Zhang, N., et al. (2021). The patient health questionnaire-9 vs. the Hamilton rating scale for depression in assessing major depressive disorder. Front Psychiatry 12:747139. doi: 10.3389/fpsyt.2021.747139

PubMed Abstract | Crossref Full Text | Google Scholar

Maurer, D. M., Raymond, T. J., and Davis, B. N. (2018). Depression: screening and diagnosis. Am. Fam. Physician 98, 508–515.

Google Scholar

Mulvaney-Day, N., Marshall, T., Downey Piscopo, K., Korsen, N., Lynch, S., Karnell, L. H., et al. (2018). Screening for behavioral health conditions in primary care settings: a systematic review of the literature. J. Gen. Intern. Med. 33, 335–346. doi: 10.1007/s11606-017-4181-0

PubMed Abstract | Crossref Full Text | Google Scholar

Mysliwiec, V., Brock, M. S., Pruiksma, K. E., Straud, C. L., Taylor, D. J., Hansen, S., et al. (2022). A comprehensive evaluation of insomnia, obstructive sleep apnea and comorbid insomnia and obstructive sleep apnea in US military personnel. Sleep 45:zsac203. doi: 10.1093/sleep/zsac203

PubMed Abstract | Crossref Full Text | Google Scholar

Negeri, Z. F., Levis, B., Sun, Y., He, C., Krishnan, A., Wu, Y., et al. (2021). Depression Screening Data (DEPRESSD) PHQ Group. Accuracy of the Patient Health Questionnaire-9 for screening to detect major depression: updated systematic review and individual participant data meta-analysis. BMJ 375:n2183. doi: 10.1136/bmj.n2183

PubMed Abstract | Crossref Full Text | Google Scholar

Python Software Foundation (2024). Python Language Reference, Version 3.12, 1. Available online at: https://www.python.org/ (Accessed March 7, 2025).

PubMed Abstract | Google Scholar

Qaseem, A., Kansagara, D., Forciea, M. A., Cooke, M., and Denberg, T. D. (2016). Management of chronic insomnia disorder in adults: a clinical practice guideline from the American College of Physicians. Ann. Intern. Med. 165, 125–133. doi: 10.7326/M15-2175

PubMed Abstract | Crossref Full Text | Google Scholar

Rodrigues, G. S., Pellini, G. L., Rocha, R., Lima, A. F., Pio de Almeida Fleck, M., Panzenhagen, A. C., et al. (2024). Diagnosis of bipolar and major depressive disorders: the appropriateness of MINI compared to the clinical interview in a sample of patients with mood disorders in tertiary mental health care. J. Psychiatr Res. 169, 341–346. doi: 10.1016/j.jpsychires.2023.11.012

PubMed Abstract | Crossref Full Text | Google Scholar

Rosenberg, R. S. (2003). Depression in the sleep center: are we treating the whole patient? Sleep Med. 4:269. doi: 10.1016/S1389-9457(03)00056-X

PubMed Abstract | Crossref Full Text | Google Scholar

Shaw, V., Ngo, Q. C., Pah, N. D., Oliveira, G., Khandoker, A. H., Mahapatra, P. K., et al. (2024). Screening major depressive disorder in patients with obstructive sleep apnea using single-lead ECG recording during sleep. Health Inf. J. 2024:30. doi: 10.1177/14604582241300012

PubMed Abstract | Crossref Full Text | Google Scholar

Sheehan, D. V., Lecrubier, Y., Sheehan, K. H., Amorim, P., Janavs, J., Weiller, E., et al. (1998). The mini-international neuropsychiatric interview (M.I.N.I.): the development and validation of a structured diagnostic psychiatric interview for DSM-IV and ICD-10. J. Clin. Psychiatry 59 Suppl 20, 22–33; quiz 34–57.

PubMed Abstract | Google Scholar

Smith, S., Sullivan, K., Hopkins, W., and Douglas, J. (2004). Frequency of insomnia report in patients with obstructive sleep apnoea hypopnea syndrome (OSAHS). Sleep Med. 5:449–456. doi: 10.1016/j.sleep.2004.03.005

PubMed Abstract | Crossref Full Text | Google Scholar

Spitzer, R. L., Kroenke, K., and Williams, J. B. (1999). Validation and utility of a self-report version of PRIME-MD: the PHQ primary care study. Primary care evaluation of mental disorders. Patient health questionnaire. JAMA 282, 1737–1744. doi: 10.1001/jama.282.18.1737

PubMed Abstract | Crossref Full Text | Google Scholar

Spitzer, R. L., Williams, J. B., Gibbon, M., and First, M. B. (1992). The structured clinical interview for DSM-III-R (SCID). I: history, rationale, and description. Arch. Gen. Psychiatry 49, 624–629. doi: 10.1001/archpsyc.1992.01820080032005

PubMed Abstract | Crossref Full Text | Google Scholar

Substance Abuse and Mental Health Services Administration (2022). Key Substance Use and Mental Health Indicators in the United States: Results From the 2021 National Survey on Drug Use and Health (HHS Publication No. PEP22-07-01-005, NSDUH Series H-57). Rockville, MD: Center for Behavioral Health Statistics and Quality, Substance Abuse and Mental Health Services Administration.

Google Scholar

Sweetman, A., Lack, L., and Bastien, C. (2019). Co-morbid insomnia and sleep apnea (COMISA): prevalence, consequences, methodological considerations, and recent randomized controlled trials. Brain Sci. 9:371. doi: 10.3390/brainsci9120371

PubMed Abstract | Crossref Full Text | Google Scholar

Sweetman, A. M., Lack, L. C., Catcheside, P. G., Antic, N. A., Chai-Coetzer, C. L., Smith, S. S., et al. (2017). Developing a successful treatment for co-morbid insomnia and sleep apnoea. Sleep Med. Rev. 33, 28–38. doi: 10.1016/j.smrv.2016.04.004

PubMed Abstract | Crossref Full Text | Google Scholar

Trevethan, R. (2017). Sensitivity, specificity, and predictive values: foundations, pliabilities, and pitfalls in research and practice. Front. Public Health 5:307. doi: 10.3389/fpubh.2017.00307

PubMed Abstract | Crossref Full Text | Google Scholar

US Preventive Services Task Force. (2002). Screening for depression: recommendations and rationale. Ann. Intern. Med. 136, 760–764. doi: 10.7326/0003-4819-136-10-200205210-00012

PubMed Abstract | Crossref Full Text | Google Scholar

Weinberger, A. H., Gbedemah, M., Martinez, A. M., Nash, D., Galea, S., and Goodwin, R. D. (2018). Trends in depression prevalence in the USA from 2005 to 2015: widening disparities in vulnerable groups. Psychol. Med. 48, 1308–1315. doi: 10.1017/S0033291717002781

PubMed Abstract | Crossref Full Text | Google Scholar

World Health Organization. (1993). The ICD-10 classification of mental and behavioural disorders: Diagnostic criteria for research. World Health Organization.

Google Scholar

Yang, C. M., Liao, Y. S., Lin, C. M., Chou, S. L., and Wang, E. N. (2011). Psychological and behavioral factors in patients with comorbid obstructive sleep apnea and insomnia. J Psychosom. Res. 70, 355–361. doi: 10.1016/j.jpsychores.2010.12.005

PubMed Abstract | Crossref Full Text | Google Scholar

Ye, L., Pien, G. W., Ratcliffe, S. J., Björnsdottir, E., Arnardottir, E. S., Pack, A. I., et al. (2013). The different clinical faces of obstructive sleep apnoea: a cluster analysis. Eur. Res. J. 44, 1600–1607. doi: 10.1183/09031936.00032314

PubMed Abstract | Crossref Full Text | Google Scholar

Zhang, Y., Ren, R., Lei, F., Zhou, J., Zhang, J., Wing, Y. K., et al. (2019). Worldwide and regional prevalence rates of co-occurrence of insomnia and insomnia symptoms with obstructive sleep apnea: a systematic review and meta-analysis. Sleep Med. Rev. 45, 1–17. doi: 10.1016/j.smrv.2019.01.004

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: obstructive sleep apnea (OSA), insomnia, comorbid insomnia and sleep apnea (COMISA), current major depressive episode (cMDE), patient health questionnaire-9 (PHQ-9), mini-international neuropsychiatric interview, depression

Citation: Defillo A, Grassi M, Daccò S, Martin JL, Caldirola D and Perna G (2025) Performance evaluation of a novel screening tool for depressive episodes in COMISA: a comparative analysis with standard neuropsychometric assessments. Front. Sleep 4:1648256. doi: 10.3389/frsle.2025.1648256

Received: 16 June 2025; Accepted: 07 August 2025;
Published: 11 September 2025.

Edited by:

Ambrose A. Chiang, Case Western Reserve University, United States

Reviewed by:

Axel Steiger, Ludwig Maximilian University of Munich, Germany
Agata Gabryelska, Medical University of Lodz, Poland
Mustafa Kursat Sahin, Ondokuz Mayis University, Türkiye

Copyright © 2025 Defillo, Grassi, Daccò, Martin, Caldirola and Perna. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Archie Defillo, YXJjaGllLmRlZmlsbG9AbWVkaWJpby5jb20uYXU=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.