复杂海洋环境中基于改进DDPG算法的船舶智能路径规划

刘燊; 杨德庆

doi:10.19693/j.issn.1673-3185.04362

复杂海洋环境中基于改进DDPG算法的船舶智能路径规划

刘燊,
杨德庆

Ship path planning based on improved DDPG algorithm in complex marine environment

摘要

摘要:
目的为增强船舶在复杂海洋环境中的路径规划与避障能力，提高船舶航行的经济性与安全性，提出一种基于改进DDPG算法的方法。
方法通过引入路径重要性评分优先经验回放、注意力机制及对抗架构，对算法的经验利用、特征感知、价值评估环节进行优化，而从提升算法性能。
结果在东海、印度洋等海域进行仿真测试，与DDPG和A*算法相比，改进算法在路径长度、拐点数量及碰撞次数方面均有显著优化。在东海海域，改进算法相较于DDPG算法，路径长度减少了0.75%，拐点数量减少了26.92%，碰撞次数减少了15.80%；相较于A*算法，路径长度减少了4.59%，拐点数量减少了42.42%。
结论改进算法在不同复杂度的海洋环境中均表现出优于DDPG算法和传统A*算法的性能，证明其优势显著、普适性强，可为船舶航行的智能化决策提供参考。

Abstract:
Objectives To enhance ship path planning and obstacle avoidance in complex marine environments while improving the efficiency and safety of ship navigation, this study proposes a novel method based on an improved DDPG algorithm.
Methods A priority experience replay mechanism, guided by a path importance score, is introduced to enhance the utilization efficiency of important experience in the learning process. A self-attention mechanism is integrated into the actor-critic network to enhance its ability to capture environmental features. In addition, the network architecture is optimized by using the dueling deep Q-network to improve the accuracy of value function estimation.
Results Simulation results in the East China Sea and the Indian Ocean show that, compared with the DDPG and A* algorithms, the improved algorithm achieves significant improvements in path length, inflection points and collision avoidance. For example, in the East China Sea, the improved algorithm reduces path length by 0.75%, inflection points by 26.92%, and collisions by 15.80% compared with the DDPG algorithm; and reduces path length by 4.59% and inflection points by 42.42% compared with the A* algorithm.
Conclusions The improved algorithm is superior to DDPG and traditional A* algorithms in marine environments of varying complexity, demonstrating its significant advantages and strong generalizability. It provides a reference for intelligent decision-making in ship navigation.

HTML全文

参考文献(20)

施引文献

资源附件(0)