ORIGINAL RESEARCH article
Front. Artif. Intell.
Sec. Medicine and Public Health
Volume 8 - 2025 | doi: 10.3389/frai.2025.1681759
Real-Time Sleep Disorder Monitoring Design Using Dynamic Temporal Graphs with Facial and Acoustic Feature Fusion
Provisionally accepted- 1University of Shanghai for Science and Technology, Shanghai, China
- 2Shanghai Shidong Hospital of Yangpu District, Shanghai, China
- 3Tongji University, Shanghai, China
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
Sleep disorders and related health complications pose significant risks to patient safety and wellbeing. This study presents a novel multimodal approach for identifying and predicting pathological conditions during sleep by analyzing facial expressions and sleep audio signals. We propose a dynamic graph modeling framework that captures the temporal evolution of these multimodal features over time t, enabling real-time monitoring and early warning of sleep-related medical events. Facial expression data is processed using deep convolutional neural networks to extract emotional and physiological indicators, while audio signals are analyzed for breathing patterns, sleep apnea events, and other acoustic biomarkers. These heterogeneous data streams are integrated into a unified dynamic graph representation where nodes represent different modalities and temporal states, and edges capture their interdependencies across time. The temporal dynamics are modeled using graph neural networks that can effectively learn complex spatiotemporal patterns indicative of various sleep pathologies. Our experimental validation demonstrates the system's capability to accurately detect and predict critical sleep events, including sleep apnea, restless leg syndrome, and cardiovascular irregularities during sleep. This multimodal framework addresses a critical gap in sleep medicine by providing continuous, non-invasive monitoring that maintains diagnostic accuracy comparable to polysomnography while eliminating patient discomfort and laboratory constraints. The system's rapid detection capabilities (average 10.7-second delay) and robust clinical agreement rates (94.6%) position it as a viable tool for both point-of-care screening and long-term home-based sleep disorder management, potentially expanding access to sleep medicine services for underserved populations.
Keywords: Sleep Disorder Detection, Facial expression analysis, real-time health monitoring, multimodal learning, machine learning
Received: 11 Aug 2025; Accepted: 21 Oct 2025.
Copyright: © 2025 Pei, Zhou, Fu and Zhou. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence: Hong Zhou, zh720828@126.com
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.