Skip to main content

ORIGINAL RESEARCH article

Front. Neurosci.
Sec. Visual Neuroscience
Volume 18 - 2024 | doi: 10.3389/fnins.2024.1368733
This article is part of the Research Topic The Operationalization of Cognitive Systems in the Comprehension of Visual Structures View all articles

Enhancing Visual Communication through Representation Learning

Provisionally accepted
YuHan Wei YuHan Wei *Changwook Lee Changwook Lee Seokwon Han Seokwon Han Anna Kim Anna Kim
  • Dankook University, Yongin, Republic of Korea

The final, formatted version of the article will be published soon.

    This research aims to address the challenges in model construction for the Extended Mind for the Design of the Human Environment. Specifically, we employ the ResNet-50, LSTM, and Object Tracking Algorithms approaches to achieve collaborative construction of high-quality virtual assets, image optimization, and intelligent agents, providing users with a virtual universe experience in the context of visual communication. Firstly, we utilize ResNet-50 as a convolutional neural network model for generating virtual assets, including objects, characters, and environments. By training and fine-tuning ResNet-50, we can generate virtual elements with high realism and rich diversity. Next, we use LSTM (Long Short-Term Memory) for image processing and analysis of the generated virtual assets. LSTM can capture contextual information in image sequences and extract/improve the details and appearance of the images. By applying LSTM, we further enhance the quality and realism of the generated virtual assets. Finally, we adopt Object Tracking Algorithms to track and analyze the movement and behavior of virtual entities within the virtual environment. Object Tracking Algorithms enable us to accurately track the positions and trajectories of objects, characters, and other elements, allowing for realistic interactions and dynamic responses. By integrating the technologies of ResNet-50, LSTM, and Object Tracking Algorithms, we can generate realistic virtual assets, optimize image details, track and analyze virtual entities, and train intelligent agents, providing users with a more immersive and interactive visual communication-driven metaverse experience. These innovative solutions have important applications in the Extended Mind for the Design of the Human Environment, enabling the creation of more realistic and interactive virtual worlds.

    Keywords: human environment, Network Science, Neurology, visual communication, extended mind, Resnet-50, LSTM, Object Tracking Algorithms

    Received: 26 Jan 2024; Accepted: 17 Apr 2024.

    Copyright: © 2024 Wei, Lee, Han and Kim. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

    * Correspondence: YuHan Wei, Dankook University, Yongin, Republic of Korea

    Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.