尼洪涛, 周清基, 柴松, 等. 基于强化学习的成品油船装载方案自主生成技术研究[J]. 中国舰船研究, 2023, 18(增刊 1): 1–10. doi: 10.19693/j.issn.1673-3185.03474
引用本文: 尼洪涛, 周清基, 柴松, 等. 基于强化学习的成品油船装载方案自主生成技术研究[J]. 中国舰船研究, 2023, 18(增刊 1): 1–10. doi: 10.19693/j.issn.1673-3185.03474
NI H T, ZHOU Q J, CHAI S, et al. Reinforcement learning-based autonomous generation technology of loading scheme for product oil tanker[J]. Chinese Journal of Ship Research, 2023, 18(Supp 1): 1–10. doi: 10.19693/j.issn.1673-3185.03474
Citation: NI H T, ZHOU Q J, CHAI S, et al. Reinforcement learning-based autonomous generation technology of loading scheme for product oil tanker[J]. Chinese Journal of Ship Research, 2023, 18(Supp 1): 1–10. doi: 10.19693/j.issn.1673-3185.03474

基于强化学习的成品油船装载方案自主生成技术研究

Reinforcement learning-based autonomous generation technology of loading scheme for product oil tanker

  • 摘要:
      目的  旨在基于强化学习方法研究液货舱装载方案自主生成技术。
      方法  以实际运营的成品油船载货量作为输入,以货舱及压载舱的装载率为目标,基于Unity ML-Agents构建智能体与环境,通过PyTorch框架对智能体进行训练,提出一种综合考虑装载时间与纵倾变化幅度的奖励函数计算方法,并以算例分析来验证所提方法的有效性。
      结果  结果显示,所训练的智能体能够学习良好的策略,并实现液货舱装载方案的自主生成。
      结论  研究结果表明,将强化学习用于解决多目标条件下的液货舱装载方案自主生成是合理可行的。

     

    Abstract:
      Objectives  This paper aims to study the automatic generation technology of loading and unloading schemes for the liquid cargo tank of oil tanker based on reinforcement learning.
      Methods  Using the cargo capacity of an actual operating oil tanker as input and the loading rates of the cargo tank and ballast water tank as the targets, an intelligent agent and environment were built based on Unity ML-Agents. The agent was trained using the PyTorch framework, and a reward function calculation method that comprehensively considers the loading time and the change in the trim amplitude was proposed. Finally, the example analysis were carried out to validate the validity of the proposed method.
      Results  The results show that, the trained agent can learn good strategies and achieve autonomous generation of liquid cargo tank loading schemes.
      Conclusions  The study showed that applying reinforcement learning to solve the problem of autonomous generation of liquid cargo tank loading schemes under multi-objective conditions is reasonable and feasible.

     

/

返回文章
返回