Skip to main content

ORIGINAL RESEARCH article

Front. Bioeng. Biotechnol.
Sec. Biomechanics
Volume 12 - 2024 | doi: 10.3389/fbioe.2024.1392513

Tongue Feature Recognition to Monitor Rehabilitation: Deep Neural Network with Visual Attention Mechanism Provisionally Accepted

Zhengheng Yi1, 2, 3 Xinsheng Lai2, 3 Aining Sun4  Senlin Fang5*
  • 1Shenzhen Bao'an Fuyong People's Hospital, China
  • 2Guangzhou University of Chinese Medicine, China
  • 3National Famous Traditional Chinese Medicine Expert LAI Xin-sheng Inheritance Studio, China
  • 4Guangdong Zhengyuanchun Traditional Chinese Medicine Clinic Co., Ltd., China
  • 5City University of Macau, Macao, SAR China

The final, formatted version of the article will be published soon.

Receive an email when it is updated
You just subscribed to receive the final version of the article

We endeavor to develop a novel deep learning architecture tailored specifically for the analysis and classification of tongue features, including color, shape, and coating. Unlike conventional methods based on architectures like VGG or ResNet, our proposed method aims to address the challenges arising from their extensive size, thereby mitigating the overfitting problem. Through this research, we aim to contribute to the advancement of techniques in tongue feature recognition, ultimately leading to more precise diagnoses and better patient rehabilitation in Traditional Chinese Medicine (TCM) .In this study, we introduce TGANet (Tongue Feature Attention Network) to enhance model performance. TGANet utilizes the initial five convolutional blocks of pre-trained VGG16 as the backbone and integrates an attention mechanism into this backbone. The integration of the attention mechanism aims to mimic human cognitive attention, emphasizing model weights on pivotal regions of the image. During the learning process, the allocation of attention weights facilitates the interpretation of causal relationships in the model's decision-making.Experimental results demonstrate that TGANet outperforms baseline models, including VGG16, ResNet18, and TSC-WNet, in terms of accuracy, precision, F1 score, and AUC metrics.Additionally, TGANet provides a more intuitive and meaningful understanding of tongue feature classification models through the visualization of attention weights.In conclusion, TGANet presents an effective approach to tongue feature classification, addressing challenges associated with model size and overfitting. By leveraging the attention mechanism and pre-trained VGG16 backbone, TGANet achieves superior performance metrics and enhances the interpretability of the model's decision-making process. The visualization of attention weights contributes to a more intuitive understanding of the classification process, making TGANet a promising tool in tongue diagnosis and rehabilitation.

Keywords: Traditional Chinese Medicine, Tongue Feature Recognition, Deep neural network, attention mechanism, Rehabilitation

Received: 27 Feb 2024; Accepted: 16 Apr 2024.

Copyright: © 2024 Yi, Lai, Sun and Fang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Dr. Senlin Fang, City University of Macau, Macao, Macao, SAR China