A Novel Approach to Depression Detection Using POV Glasses and Machine Learning for Multimodal Analysis

Kayış, Hakan; Çelik, Murat; Çakır Kardeş, Vildan; Karabulut, Hatice  Aysima; Özkan, Ezgi; Gedizlioğlu, Çınar; Özbaran, Burcu; Ulusoy, Nuray

doi:10.3389/fpsyt.2025.1720990

ORIGINAL RESEARCH article

Front. Psychiatry

Sec. Digital Mental Health

Volume 16 - 2025 | doi: 10.3389/fpsyt.2025.1720990

This article is part of the Research TopiceHealth and Personalized Medicine in Mental Health and Neurodevelopmental Disorders: Digital Innovation for Diagnosis, Care, and Clinical ManagementView all 9 articles

A Novel Approach to Depression Detection Using POV Glasses and Machine Learning for Multimodal Analysis

Provisionally accepted

Hakan Kayış^1*

Murat Çelik²

Vildan Çakır Kardeş¹

Hatice Aysima Karabulut¹

Ezgi Özkan¹

Çınar Gedizlioğlu³

Burcu Özbaran⁴

Nuray Ulusoy¹

¹Zonguldak Bulent Ecevit Universitesi Tip Fakultesi, Zonguldak, Türkiye
²Free Researcher, Ankara, Türkiye
³İzmir Ekonomi Üniversitesi Bilgisayar Mühendisliği, izmir, Türkiye
⁴Ege Universitesi Tip Fakultesi, Izmir, Türkiye

The final, formatted version of the article will be published soon.

Background: Major depressive disorder (MDD) remains challenging to diagnose due to its reliance on subjective interviews and self-reports. Objective, technology-driven methods are increasingly needed to support clinical decision-making. Wearable point-of-view (POV) glasses, which capture both visual and auditory streams, may offer a novel solution for multimodal behavioral analysis. Objective: This study investigated whether features extracted from POV glasses, analyzed with machine learning, can differentiate individuals with MDD from healthy controls. Methods: We studied 44 MDD patients and 41 age/sex-matched HCs (18–55 years). During semi-structured interviews, POV glasses recorded video and audio data. Visual features included gaze distribution, smiling duration, eye-blink frequency, and head movements. Speech features included response latency, silence ratio, and word count.Recursive feature elimination was applied. Multiple classifiers were evaluated, and the primary model—ExtraTrees—was assessed using leave-one-out cross-validation. Results: After Bonferroni correction, smiling duration, center gaze and happy face duration showed significant group differences. The multimodal classifier achieved an accuracy of 84.7%, sensitivity of 90.9%, specificity of 78%, and an F1 score of 86%. Conclusions: POV glasses combined with machine learning successfully captured multimodal behavioral markers distinguishing MDD from controls. This low-burden, wearable approach demonstrates promise as an objective adjunct to psychiatric assessment. Future studies should evaluate its generalizability in larger, more diverse populations and real-world clinical settings.

Keywords: Major Depressive Disorder, machine learning, multimodal analysis, Wearable Technology, Point-of-View Glasses

Received: 08 Oct 2025; Accepted: 22 Oct 2025.

Copyright: © 2025 Kayış, Çelik, Çakır Kardeş, Karabulut, Özkan, Gedizlioğlu, Özbaran and Ulusoy. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Hakan Kayış, drhakankayis@gmail.com

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.