Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Artif. Intell.

Sec. Medicine and Public Health

Volume 8 - 2025 | doi: 10.3389/frai.2025.1681759

Real-Time Sleep Disorder Monitoring Design Using Dynamic Temporal Graphs with Facial and Acoustic Feature Fusion

Provisionally accepted
Fei  PeiFei Pei1Ying  ZhouYing Zhou2Qiangqiang  FuQiangqiang Fu3Hong  ZhouHong Zhou2*
  • 1University of Shanghai for Science and Technology, Shanghai, China
  • 2Shanghai Shidong Hospital of Yangpu District, Shanghai, China
  • 3Tongji University, Shanghai, China

The final, formatted version of the article will be published soon.

Sleep disorders and related health complications pose significant risks to patient safety and wellbeing. This study presents a novel multimodal approach for identifying and predicting pathological conditions during sleep by analyzing facial expressions and sleep audio signals. We propose a dynamic graph modeling framework that captures the temporal evolution of these multimodal features over time t, enabling real-time monitoring and early warning of sleep-related medical events. Facial expression data is processed using deep convolutional neural networks to extract emotional and physiological indicators, while audio signals are analyzed for breathing patterns, sleep apnea events, and other acoustic biomarkers. These heterogeneous data streams are integrated into a unified dynamic graph representation where nodes represent different modalities and temporal states, and edges capture their interdependencies across time. The temporal dynamics are modeled using graph neural networks that can effectively learn complex spatiotemporal patterns indicative of various sleep pathologies. Our experimental validation demonstrates the system's capability to accurately detect and predict critical sleep events, including sleep apnea, restless leg syndrome, and cardiovascular irregularities during sleep. This multimodal framework addresses a critical gap in sleep medicine by providing continuous, non-invasive monitoring that maintains diagnostic accuracy comparable to polysomnography while eliminating patient discomfort and laboratory constraints. The system's rapid detection capabilities (average 10.7-second delay) and robust clinical agreement rates (94.6%) position it as a viable tool for both point-of-care screening and long-term home-based sleep disorder management, potentially expanding access to sleep medicine services for underserved populations.

Keywords: Sleep Disorder Detection, Facial expression analysis, real-time health monitoring, multimodal learning, machine learning

Received: 11 Aug 2025; Accepted: 21 Oct 2025.

Copyright: © 2025 Pei, Zhou, Fu and Zhou. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Hong Zhou, zh720828@126.com

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.