ORIGINAL RESEARCH article
Front. Future Transp.
Sec. Freight Transport and Logistics
Volume 6 - 2025 | doi: 10.3389/ffutr.2025.1603726
This article is part of the Research TopicInnovations and Challenges in Freight Transportation: Navigating the Future of Global LogisticsView all articles
Adaptive Vehicle Routing for Humanitarian Aid in Conflict-Affected Regions: A Practitioner-Informed Deep Reinforcement Learning Approach
Provisionally accepted- 1King Faisal University, Al-Ahsa, Saudi Arabia
- 2University of Sousse, Sousse, Tunisia
Select one of your emails
You have multiple emails registered with Frontiers:
Notify me on publication
Please enter your email address:
If you already have an account, please login
You don't have a Frontiers account ? You can register here
Humanitarian aid delivery in conflict-affected regions faces significant challenges due to dynamic security risks, uncertain demand, and complex operational constraints. Traditional optimization methods struggle with computational intractability and lack adaptability for real-time decision-making in volatile environments. To address these limitations, we propose a novel hybrid framework that integrates Deep Reinforcement Learning (DRL) with Graph Neural Networks (GNNs) and deterministic constraint validation, informed by practitioner insights to ensure real-world applicability. Our approach employs Proximal Policy Optimization (PPO) enhanced by GNN-based spatial representations to learn adaptive, efficient vehicle routing policies under uncertainty. A post-decision validation mechanism enforces feasibility by penalizing constraint violations based on a deterministic equivalent model. We evaluate our method on realistic, georeferenced datasets reflecting Afghan road networks and conflict data, comparing it against classical PPO and heuristic baselines. Results demonstrate that PPO-GNN significantly reduces operational costs (by 7.9%), security risk exposure (by 15.2%), and unmet demand, while improving reliability and adherence to constraints. The approach scales effectively across network sizes and maintains robustness under stochastic variations in demand and security conditions. Our framework balances computational efficiency with practical relevance, aligning with humanitarian priorities and offering a promising decision-support tool for aid logistics in conflict zones.
Keywords: Humanitarian Aid Delivery, Conflict Zone Logistics, deep reinforcement learning, Graph neural networks, Proximal policy optimization, Adaptive vehicle routing, Practitioner-Informed Modeling
Received: 31 Mar 2025; Accepted: 09 Sep 2025.
Copyright: © 2025 MILI and Argoubi. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
* Correspondence: Khaled MILI, King Faisal University, Al-Ahsa, Saudi Arabia
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.