Anomaly Detection Method for Power Dispatch Streaming Data Based on Adaptive Isolation Forest and Self-Su-pervised Learning

Xie, Tenglong; Li, Bo; Zhen, Yingkai; Wei, Hao; Xu, Kunhuan; Huang, Jingyin

doi:10.3389/fphy.2025.1704495

ORIGINAL RESEARCH article

Front. Phys.

Sec. Social Physics

This article is part of the Research TopicSecurity, Governance, and Challenges of the New Generation of Cyber-Physical-Social Systems, Volume IIView all 16 articles

Anomaly Detection Method for Power Dispatch Streaming Data Based on Adaptive Isolation Forest and Self-Su-pervised Learning

Provisionally accepted

Tenglong Xie^* Bo Li

Bo Li

Yingkai Zhen Hao Wei

Hao Wei

Kunhuan Xu

Jingyin Huang

Information Center of Guangdong Power Grid Co., Ltd. GuangZhou 510600 China, guangzhou, China

The final, formatted version of the article will be published soon.

To address the issues of concept drift and scarcity of anomaly samples in real-time anomaly detection under the massive streaming data environment of power dispatching and control systems. We propose a streaming data anomaly detection method that integrates adaptive isolation forests and self-supervised learning. First, by analyzing the inherent relationships between system services, processes, and resource usage, we construct a business-based model for anomaly detection. Furthermore, we propose an improved isolation forest algorithm based on a sub-forest progressive update mechanism. By selectively eliminating sub-detectors with large anomaly rate deviations and dynamically adding new detectors, we overcome the model performance degradation caused by traditional random update strategies and significantly improve the detection algorithm's adaptability to concept drift and overall stability. To further explore the temporal features in streaming data, we introduce a self-supervised learning framework based on the GPT architecture. We design state memory units to encode historical data patterns and reduce data redundancy through a sampling strategy based on distance metrics, effectively enhancing the ability to perceive hidden anomalies. Experiments on a real power dispatching process resource dataset show that the proposed method significantly out-performs the traditional streaming data isolation forest algorithm in key indicators such as AUC value, with the highest improvement reaching 39.12%. Ablation experiments verify the effectiveness of each module, providing reliable technical support for the safe and stable operation of the power dispatching system.

Keywords: Adaptive Isolation Forest, Self-supervised learning, Power Dispatch System, Streaming data, anomaly detection

Received: 13 Sep 2025; Accepted: 18 Nov 2025.

Copyright: © 2025 Xie, Li, Zhen, Wei, Xu and Huang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Tenglong Xie, dsqyy124@163.com

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.