复杂海洋环境中基于改进DDPG算法的船舶智能路径规划

Ship path planning based on improved DDPG algorithm in complex marine environment

  • 摘要:
    目的 为增强船舶在复杂海洋环境中的路径规划与避障能力,提高船舶航行的经济性与安全性,提出一种基于改进DDPG算法的方法。
    方法 通过引入路径重要性评分优先经验回放、注意力机制及对抗架构,对算法的经验利用、特征感知、价值评估环节进行优化,而从提升算法性能。
    结果 在东海、印度洋等海域进行仿真测试,与DDPG和A*算法相比,改进算法在路径长度、拐点数量及碰撞次数方面均有显著优化。在东海海域,改进算法相较于DDPG算法,路径长度减少了0.75%,拐点数量减少了26.92%,碰撞次数减少了15.80%;相较于A*算法,路径长度减少了4.59%,拐点数量减少了42.42%。
    结论 改进算法在不同复杂度的海洋环境中均表现出优于DDPG算法和传统A*算法的性能,证明其优势显著、普适性强,可为船舶航行的智能化决策提供参考。

     

    Abstract:
    Objectives To enhance ship path planning and obstacle avoidance in complex marine environments while improving the efficiency and safety of ship navigation, this study proposes a novel method based on an improved DDPG algorithm.
    Methods A priority experience replay mechanism, guided by a path importance score, is introduced to enhance the utilization efficiency of important experience in the learning process. A self-attention mechanism is integrated into the actor-critic network to enhance its ability to capture environmental features. In addition, the network architecture is optimized by using the dueling deep Q-network to improve the accuracy of value function estimation.
    Results Simulation results in the East China Sea and the Indian Ocean show that, compared with the DDPG and A* algorithms, the improved algorithm achieves significant improvements in path length, inflection points and collision avoidance. For example, in the East China Sea, the improved algorithm reduces path length by 0.75%, inflection points by 26.92%, and collisions by 15.80% compared with the DDPG algorithm; and reduces path length by 4.59% and inflection points by 42.42% compared with the A* algorithm.
    Conclusions The improved algorithm is superior to DDPG and traditional A* algorithms in marine environments of varying complexity, demonstrating its significant advantages and strong generalizability. It provides a reference for intelligent decision-making in ship navigation.

     

/

返回文章
返回