Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Appl. Math. Stat.

Sec. Optimization

Volume 11 - 2025 | doi: 10.3389/fams.2025.1628652

This article is part of the Research TopicMathematical Optimization for Decision Support Systems: Practices and Strategies for Sustainable Supply Chain ManagementView all articles

Accounting Data Anomaly Detection and Prediction Based on Self-supervised Learning

Provisionally accepted
Yingying  ZhangYingying Zhang1*Bingbing  DuanBingbing Duan2
  • 1Chengdu College of Arts and Sciences, Chengdu, China
  • 2Chengdu Huawei Technologies Co., Ltd, Chengdu, China

The final, formatted version of the article will be published soon.

This study proposes a Hierarchical Fusion Self-Supervised Learning (HFSL) framework to address the challenge of scarce labeled data in accounting anomaly detection, integrating domain knowledge with advanced deep learning techniques.Based on financial data from Chinese listed companies in the CSMAR database spanning 2000-2020, this framework integrates temporal contrastive learning, a dualchannel LSTM autoencoder structure, and financial domain knowledge to construct a three-tier cascaded detection system. Empirical research demonstrates that the HFSL framework achieves a precision of 0.836, recall of 0.805, and F1 score of 0.820 in accounting anomaly detection, significantly outperforming traditional methods. In terms of practical metrics, the framework attains an early detection rate of 0.726 while maintaining a false alarm rate of just 0.068, providing technical support for early risk warning. Financial feature contribution analysis reveals that core indicators such as Return on Assets (ROA), Return on Equity (ROE), and their interaction effects play crucial roles in anomaly identification. Through analysis of 2,150 samples in the test set, the study identifies five typical financial fraud patterns (revenue inflation 38.6%, expense concealment 21.7%, asset overvaluation 17.4%, liability understatement 15.2%, and composite manipulation 7.1%) and their temporal evolution characteristics.The research also finds that financial anomalies typically exhibit three evolutionary patterns: progressive deterioration (64%), sudden anomalies (22%), or cyclical fluctuations (15%), providing empirical evidence for regulatory practice. This study applies self-supervised learning to accounting anomaly detection, not only solving the detection challenges in unlabeled data scenarios but also providing effective tools for financial supervision and risk management.

Keywords: Accounting data, anomaly detection, Financial fraud, hierarchical fusion framework, Self-supervised learning

Received: 14 May 2025; Accepted: 18 Aug 2025.

Copyright: © 2025 Zhang and Duan. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Yingying Zhang, Chengdu College of Arts and Sciences, Chengdu, China

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.