Time Series Forecasting for Bug Resolution using Machine Learning and Deep Learning Models

Aversano, Lerina; Iammarino, Martina; Madau, Antonella; Pecorelli, Fabiano

doi:10.3389/fdata.2025.1745751

ORIGINAL RESEARCH article

Front. Big Data

Sec. Data Mining and Management

This article is part of the Research TopicSmart Forecasting: Deep Learning and Explainable AI for Real-World Time Series PredictionView all 3 articles

Time Series Forecasting for Bug Resolution using Machine Learning and Deep Learning Models

Provisionally accepted

Lerina Aversano¹

Martina Iammarino^2*

Antonella Madau³

Fabiano Pecorelli²

¹Department of Agricultural Science, Food, Natural Resources and Engineering, Universita degli Studi di Foggia, Foggia, Italy
²Department of Information Science and Technology, Pegaso University, Naples, Italy
³Department of Engineering, Universita degli Studi del Sannio, Benevento, Italy

The final, formatted version of the article will be published soon.

Predicting bug fix times is a key objective for improving software maintenance and supporting planning in open source projects. In this study, we evaluate the effectiveness of different time series forecasting models applied to real-world data from multiple repositories, comparing local (one model per project) and global (a single model trained across multiple projects) approaches. We considered classical models (Naive, Linear Regression, Random Forest) and neural networks (MLP, LSTM, GRU), with global extensions including Random Forest and LSTM with project embeddings. The results highlight that, at the local level, Random Forest achieves lower errors and better classification metrics than deep learning models in several cases. However, global models show greater robustness and generalizability: in particular, the global Random Forest significantly reduces the mean error and maintains high performance in terms of accuracy and F1 score, while the global LSTM captures temporal dependencies and provides additional insights into cross-project dynamics. The explainable AI techniques adopted (permutation importance, saliency maps, and embedding analysis) allow us to interpret the main drivers of forecasts, confirming the role of process variables and temporal characteristics. Overall, the study demonstrates that an integrated approach, combining classical models and deep learning in a global perspective, offers more reliable and interpretable forecasts to support software maintenance.

Keywords: time series forecasting, Bug Resolution, Machine Learning and Deep Learning Models, Explainable artificialintelligence, Software maintenance

Received: 13 Nov 2025; Accepted: 28 Nov 2025.

Copyright: © 2025 Aversano, Iammarino, Madau and Pecorelli. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Martina Iammarino

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.