ORIGINAL RESEARCH article
Front. Psychiatry
Sec. Computational Psychiatry
Volume 16 - 2025 | doi: 10.3389/fpsyt.2025.1659536
This article is part of the Research TopicDeep Learning for High-Dimensional Sense, Non-Linear Signal Processing and Intelligent Diagnosis, vol IIView all articles
Multimodal Deep Learning for Non-Linear Signal Interpretation in Psychiatric Diagnostics
Provisionally accepted- Shandong Jiaotong University, Jinan, China
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
The increasing complexity of high-dimensional sensory data in psychiatric diagnostics necessitates advanced methodologies capable of modeling nonlinear and heterogeneous signals. Conventional approaches often treat modalities independently, overlooking critical inter-modal relationships and semantic consistencies that are pivotal for accurate and comprehensive mental health assessments. Addressing this challenge, we propose a multimodal deep learning framework tailored for intelligent nonlinear signal interpretation. Central to this framework is the Fusion-Aware Relational Encoder (FARE), an innovative architecture designed to capture high-order interactions among modalities while maintaining their unique characteristics through relational attention mechanisms and adaptive fusion techniques. Additionally, the framework incorporates a Modality-Aligned Optimization Strategy (MAOS) to ensure balanced learning by integrating uncertainty regularization, cross-modal contrastive alignment, and a curriculum-based training methodology. This synergistic combination significantly mitigates issues such as gradient conflicts and modality imbalance, enhancing semantic coherence and model robustness. Extensive empirical evaluations conducted on diverse multimodal psychiatric datasets demonstrate the framework's superior performance in classification accuracy and interpretability, highlighting its potential to advance the field of nonlinear signal processing for intelligent psychiatric diagnostics.
Keywords: Multimodal deep learning, Nonlinear signal processing, Relational attention mechanism, Modality-aligned optimizationstrategy, Intelligent psychiatric diagnostics
Received: 04 Jul 2025; Accepted: 25 Aug 2025.
Copyright: © 2025 Pan. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence: Yixuan Pan, Shandong Jiaotong University, Jinan, China
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.