AUTHOR=Mou Jianhui , Shi Bo , Wang Bo , Yu Chengcheng , Wang Yangwei , Zhong Fusheng , Zheng Li , Wang Jian , Li Junjie 

TITLE=A novel reinforcement learning framework-based path planning algorithm for unmanned surface vehicle

JOURNAL=Frontiers in Marine Science

VOLUME=Volume 12 - 2025

YEAR=2025

URL=https://www.frontiersin.org/journals/marine-science/articles/10.3389/fmars.2025.1641093

DOI=10.3389/fmars.2025.1641093

ISSN=2296-7745

ABSTRACT=Unmanned surface vehicles (USVs) nowadays have been widely used in ocean observation missions, helping researchers to monitor climate change, collect environmental data, and observe marine ecosystem processes. However, path planning for USVs often faces several inherent difficulties during ocean observation missions: high dependence on environmental information, long convergence time, and low-quality generated paths. To solve these problems, this article proposes a novel artificial potential field-heuristic reward-averaging deep Q-network (APF-RADQN) framework-based path planning algorithm, aiming at finding optimal paths for USVs. First, the USV path planning is modeled as a Markov decision process (MDP). Second, a comprehensive reward function incorporating artificial potential field (APF) inspiration is designed to guide the USV to reach the target region. Subsequently, an optimized deep neural network with a reward-averaging strategy is constructed to effectively enhance the learning and convergence speed of the algorithm, thus further improving the global search capability and interface performance of USV path planning. In addition, the Bezier curve is applied to make the generated path more feasible. Finally, the effectiveness of the proposed algorithm is verified by comparing it with the DQN, A*, and APF algorithms in simulation experiments. Simulation results demonstrate that the APF-RADQN improves the interface ability and path quality, significantly enhancing the USV navigation safety and ocean observation mission operation efficiency.