Research Area:  Machine Learning
In this paper, a novel multi-USV formation path planning algorithm is proposed based on deep reinforcement learning. First, a goal-based hierarchical reinforcement learning algorithm is designed to improve training speed and resolve planning conflicts within the formation. Second, an improved artificial potential field algorithm is designed in the training process to obtain the optimal path planning and obstacle avoidance learning scheme for multi-USVs in the determined perceptual environment. Finally, a formation geometry model is established to describe the physical relationships among USVs, and a composite reward function is proposed to guide the training. Numerous simulation tests are conducted, and the effectiveness of the proposed algorithm are further validated through the NEU-MSV01 experimental platform with a combination of parameterized Line of Sight (LOS) guidance.
Keywords:  
Author(s) Name:  Xiangwei Wei , Hao Wang , Yixuan Tang
Journal name:  Ocean Engineering
Conferrence name:  
Publisher name:  ScienceDirect
DOI:  https://doi.org/10.1016/j.oceaneng.2023.115577
Volume Information:  Volume 286,(2023)
Paper Link:   https://www.sciencedirect.com/science/article/abs/pii/S0029801823019613