Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Comput. Sci.

Sec. Computer Vision

This article is part of the Research TopicLarge Tensor Analysis and ApplicationsView all 5 articles

Towards Real-Time Emotion Recognition in Fog Computing-Based Systems: Leveraging Interpretable PCA_CNN, YOLO with Self-Attention Mechanism

Provisionally accepted
Nora  El RashidyNora El Rashidy1*Eman  AllogmaniEman Allogmani2Esraa  HassanEsraa Hassan1Hela  ElmannaiHela Elmannai3Zainab  H. AliZainab H. Ali1
  • 1Kafrelsheikh University, Kafr el-Sheikh, Egypt
  • 2Majmaah University College of Engineering, Al Majmaah, Saudi Arabia
  • 3Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia

The final, formatted version of the article will be published soon.

ABSTRACT Emotion estimation from face expression analysis has been extensively examined in computer science. In contrast, classifying expressions depends on appropriate facial features and their dynamics. Despite the promising accuracy results in handled and favorable conditions, processing faces acquired at a distance, entailing low-quality images, still needs an influential performance reduction. The primary objective of this study is to introduce a Real-Time Emotion Recognition system-based Fog Technique, which was developed to track and observe human emotional states in real time.This paper provides a comprehensive integration of PCA-based feature selection with a specific version of YOLO (YOLOv8), in addition to spatial attention for real-time recognition. The developed system demonstrates superiority in edge deployment capabilities compared to existing approaches. The proposed model is compared with the EL Rashidy et al. CNN_PCA hybrid model. First, Principal Component Analysis (PCA) is employed as a dimension-reduction tool, focusing on the most informative characteristics during training, and then CNN as classification layer. The proposed system's performance is assessed via a dataset of 35,888 facial photos classified into seven classes: anger, fear, happiness, neutral, sadness, surprise, and disgust. The constructed model surpasses established pre-trained models, such as VGG, ResNet, and MobileNet, with different evaluation metrics. First, the PCA_CNN model achieved superior accuracy, precision, recall, and Area Under the Curve (AUC) scores of 0.936, 0.971, 0.843, 0.871, and 0.943.YOLO v8 aith attention model achieved 0.986, 0.902, 0.941, and 0.952. Additionally, the model exhibits significantly faster processing time, completing computations in just 610 seconds than other pre-trained models. To validate the model's superiority, extensive testing on additional datasets consistently yields promising performance results, further validating the efficiency and effectiveness of our developed model in real-time emotion recognition for advancing affective computing applications.

Keywords: attention mechanism, Convolutional Neural Network, dimension reduction, emotion recognition, fog computing, YOLO

Received: 27 Sep 2025; Accepted: 25 Nov 2025.

Copyright: © 2025 El Rashidy, Allogmani, Hassan, Elmannai and H. Ali. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Nora El Rashidy

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.