REVIEW article

Front. Ind. Eng.

Sec. Industrial Informatics

Volume 3 - 2025 | doi: 10.3389/fieng.2025.1611512

This article is part of the Research TopicLearning-driven Optimization for Solving Scheduling and LogisticsView all 5 articles

Multi-agent Reinforcement Learning for Flexible Shop Scheduling Problem: A Survey

Provisionally accepted
  • 1College of Information Science and Engineering, Henan University of Technology, Zhengzhou, Henan Province, China
  • 2School of Artificial Intelligence and Big Data, Henan University of Technology, Zhengzhou, China
  • 3Research Institute for Science and Technology, Tokyo University of Science, Tokyo, Japan
  • 4Faculty of Science and Technology, Tokyo University of Science, Tokyo, Japan

The final, formatted version of the article will be published soon.

This paper presents a systematic and comprehensive review of multi-agent reinforcement learning (MARL) methodologies and their applications in addressing the flexible shop scheduling problem (FSSP), a fundamental yet challenging optimization paradigm in contemporary manufacturing systems. While conventional optimization approaches exhibit limitations in handling the inherent multi-resource constraints, dynamics and stochastic characteristics of real-world FSSP scenarios, MARL has emerged as a promising alternative framework, particularly due to its capability to effectively manage complex, decentralized decision-making processes in dynamic environments. Through a rigorous analytical framework, this study synthesizes and evaluates the current state-of-the-art MARL implementations in FSSP contexts, encompassing critical aspects such as problem formulation paradigms, agent architectural designs, learning algorithm frameworks, and inter-agent coordination mechanisms. We conduct an in-depth examination of the fundamental challenges inherent in MARL applications to FSSP, including the optimization of state-action space representations, the design of effective reward mechanisms, and the resolution of scalability constraints. Furthermore, this review provides a comparative analysis of diverse MARL paradigms, including centralized training with decentralized execution, fully decentralized approaches, and hierarchical methodologies, critically evaluating their respective advantages and limitations within the FSSP domain. The study culminates in the identification of significant research gaps and promising future research directions, with particular emphasis on theoretical foundations and practical implementations. This comprehensive review serves as an authoritative reference for researchers and practitioners in the field, providing a robust theoretical foundation and practical insights for advancing the application of MARL in flexible shop scheduling and related manufacturing optimization domains. The findings presented herein contribute to the broader understanding of intelligent manufacturing systems and computational optimization in Industry 4.0 contexts.

Keywords: Flexible shop scheduling problem, Flexible Job-shop Scheduling Problem, hybrid flow-shop scheduling problem, Multi-agent reinforcement learning, reinforcement learning

Received: 14 Apr 2025; Accepted: 14 Jul 2025.

Copyright: © 2025 Xu, Gu, Zhang, Gen and Ohwada. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Wenqiang Zhang, School of Artificial Intelligence and Big Data, Henan University of Technology, Zhengzhou, China

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.