ORIGINAL RESEARCH article
Front. Neuroinform.
This article is part of the Research TopicMultimodal Brain Data Integration and Computational ModelingView all 4 articles
Cross Modal Privacy-Preserving Synthesis and Mixture of Experts Ensemble for Robust ASD Prediction
Provisionally accepted- 1Bannari Amman Institute of Technology Department of Computer Science and Engineering, Sathyamangalam, India
- 2Christ the King Engineering College, Coimbatore, India
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
Abstract Introduction: Autism Spectrum Disorder (ASD) diagnosis remains complex due to limited access to large-scale multimodal datasets and privacy concerns surrounding clinical data. Traditional methods rely heavily on resource-intensive clinical assessments and are constrained by unimodal or non-adaptive learning models. To address these limitations, this study introduces AutismSynthGen, a privacy-preserving framework for synthesizing multimodal ASD data and enhancing prediction accuracy. Methods: The proposed system integrates a Multimodal Autism Data Synthesis Network (MADSN), which employs transformer-based encoders and cross-modal attention within a conditional GAN to generate synthetic data across structural MRI, EEG, behavioural vectors, and severity scores. Differential privacy is enforced via DP-SGD (ε ≤ 1.0). A complementary Adaptive Multimodal Ensemble Learning (AMEL) module, consisting of five heterogeneous experts and a gating network, is trained on both real and synthetic data. Evaluation is conducted on ABIDE, NDAR, and SSC datasets using metrics such as AUC, F1 score, MMD, KS statistic, and BLEU. Results: Synthetic augmentation improved model performance with validation AUC gains of ≥ 0.04. AMEL achieved AUC = 0.98 and F1 = 0.99 on real data, and approached near-perfect internal performance (AUC ≈ 1.00, F1 ≈ 1.00) when synthetic data were included. Distributional metrics (MMD = 0.04; KS = 0.03) and text similarity (BLEU = 0.70) demonstrated high fidelity between real and synthetic samples. Ablation studies confirmed the importance of cross-modal attention and entropy-regularized expert gating. Discussion: AutismSynthGen offers a scalable, privacy-compliant solution for augmenting limited multimodal datasets and enhancing ASD prediction. Future directions include semi-supervised learning, explainable AI for clinical trust, and deployment in federated environments to broaden accessibility while maintaining privacy.
Keywords: Autism Spectrum Disorder, multimodal data synthesis, Differential privacy, generative adversarial network, ensemble learning, transformer, Mixture of experts
Received: 04 Aug 2025; Accepted: 28 Oct 2025.
Copyright: © 2025 M and J. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence: Karthiga M, mkarthiga22@gmail.com
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.
