Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Big Data

Sec. Machine Learning and Artificial Intelligence

Volume 8 - 2025 | doi: 10.3389/fdata.2025.1705587

This article is part of the Research TopicAI-Driven Architectures and Algorithms for Secure and Scalable Big Data SystemsView all 4 articles

PHTFNet-RPM: a probabilistic hybrid network with RPM for tobacco root disease forecasting

Provisionally accepted
Yunhong  BuYunhong Bu1Tingshan  YaoTingshan Yao2Shaowu  GengShaowu Geng1Renjie  HUANGRenjie HUANG3*
  • 1Chuxiong Company of Yunnan Provincial Tobacco Corporation, Chuxiong, China
  • 2National Engineering Research Center for Citrus Technology, Citrus Research Institute, Southwest University, Chongqing, China
  • 3College of Computer and Information Science, Southwest University, Chuxiong, China

The final, formatted version of the article will be published soon.

Tobacco growers usually face particular challenges in predicting risks of tobacco root diseases due to complex pathogenesis, concealed early symptoms and heterogeneous farm conditions. To solve this problem, we proposed a flexible Probatilistic Hybrid Temporal Fusion Network with Random Period Mask (PHTFNet-RPM) to forecast the disease incidences and disease indices in future multiple days. It allows for a hybrid-structured input with RPM, incorporating configurable static management variables and time-series variables of weather factors and disease metrics, in which RPM simulates the diverse absences of historical observations. Its internal hierarchically aggregated modules learn cross-variable and cross-temporal feature representations to model the complex nonlinear relation between independent variables and target disease metrics. Especially, the probabilistic theory-based uncertainty quantifications of model and prediction results are designed to improve model's credibility and reliability. To verify the proposed PHTFNet-RPM, a large-scale time-series dataset of tobacco root diseases is constructed by organizing 20-year meteorological and disease survey records from Chuxiong Prefecture, Yunnan Province. Extensive comparative experiments demonstrate that our model achieves 4.44–16.43% lower mean absolute error (MAE) than the existing models (LR, SVR, CNN-LSTM, LSTM-Attention etc.), and it can forecast the reliable disease progression trends under different configurations, even only relying on the historical weather observations.

Keywords: Hybrid neural network, random period mask, Uncertainty estimate, Plant disease forecasting, Time-series modeling, smartagriculture

Received: 15 Sep 2025; Accepted: 20 Oct 2025.

Copyright: © 2025 Bu, Yao, Geng and HUANG. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Renjie HUANG, huangrj@swu.edu.cn

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.