Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Future Transp.

Sec. Freight Transport and Logistics

Volume 6 - 2025 | doi: 10.3389/ffutr.2025.1603726

This article is part of the Research TopicInnovations and Challenges in Freight Transportation: Navigating the Future of Global LogisticsView all articles

Adaptive Vehicle Routing for Humanitarian Aid in Conflict-Affected Regions: A Practitioner-Informed Deep Reinforcement Learning Approach

Provisionally accepted
  • 1King Faisal University, Al-Ahsa, Saudi Arabia
  • 2University of Sousse, Sousse, Tunisia

The final, formatted version of the article will be published soon.

Humanitarian aid delivery in conflict-affected regions faces significant challenges due to dynamic security risks, uncertain demand, and complex operational constraints. Traditional optimization methods struggle with computational intractability and lack adaptability for real-time decision-making in volatile environments. To address these limitations, we propose a novel hybrid framework that integrates Deep Reinforcement Learning (DRL) with Graph Neural Networks (GNNs) and deterministic constraint validation, informed by practitioner insights to ensure real-world applicability. Our approach employs Proximal Policy Optimization (PPO) enhanced by GNN-based spatial representations to learn adaptive, efficient vehicle routing policies under uncertainty. A post-decision validation mechanism enforces feasibility by penalizing constraint violations based on a deterministic equivalent model. We evaluate our method on realistic, georeferenced datasets reflecting Afghan road networks and conflict data, comparing it against classical PPO and heuristic baselines. Results demonstrate that PPO-GNN significantly reduces operational costs (by 7.9%), security risk exposure (by 15.2%), and unmet demand, while improving reliability and adherence to constraints. The approach scales effectively across network sizes and maintains robustness under stochastic variations in demand and security conditions. Our framework balances computational efficiency with practical relevance, aligning with humanitarian priorities and offering a promising decision-support tool for aid logistics in conflict zones.

Keywords: Humanitarian Aid Delivery, Conflict Zone Logistics, deep reinforcement learning, Graph neural networks, Proximal policy optimization, Adaptive vehicle routing, Practitioner-Informed Modeling

Received: 31 Mar 2025; Accepted: 09 Sep 2025.

Copyright: © 2025 MILI and Argoubi. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Khaled MILI, King Faisal University, Al-Ahsa, Saudi Arabia

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.