Intelligent Smart Marine Autonomous Surface Ship Decision System Based on Improved PPO Algorithm

被引:17
|
作者
Guan, Wei [1 ]
Cui, Zhewen [1 ]
Zhang, Xianku [1 ]
机构
[1] Dalian Maritime Univ, Nav Coll, Dalian 116026, Peoples R China
基金
中国国家自然科学基金;
关键词
decision-making; deep reinforcement learning; Nomoto; PPO; SMASS; COLLISION-AVOIDANCE;
D O I
10.3390/s22155732
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
With the development of artificial intelligence technology, the behavior decision-making of an intelligent smart marine autonomous surface ship (SMASS) has become particularly important. This research proposed local path planning and a behavior decision-making approach based on improved Proximal Policy Optimization (PPO), which could drive an unmanned SMASS to the target without requiring any human experiences. In addition, a generalized advantage estimation was added to the loss function of the PPO algorithm, which allowed baselines in PPO algorithms to be self-adjusted. At first, the SMASS was modeled with the Nomoto model in a simulation waterway. Then, distances, obstacles, and prohibited areas were regularized as rewards or punishments, which were used to judge the performance and manipulation decisions of the vessel Subsequently, improved PPO was introduced to learn the action-reward model, and the neural network model after training was used to manipulate the SMASS's movement. To achieve higher reward values, the SMASS could find an appropriate path or navigation strategy by itself. After a sufficient number of rounds of training, a convincing path and manipulation strategies would likely be produced. Compared with the proposed approach of the existing methods, this approach is more effective in self-learning and continuous optimization and thus closer to human manipulation.
引用
收藏
页数:33
相关论文
共 50 条
  • [11] Autonomous Driver Based on an Intelligent System of Decision-Making
    Michał Czubenko
    Zdzisław Kowalczuk
    Andrew Ordys
    Cognitive Computation, 2015, 7 : 569 - 581
  • [12] The Intelligent Layout of the Ship Piping System Based on the Optimization Algorithm
    Wei, Zhiguo
    Wu, Jun
    Li, Zhe
    Cheng, Shangfang
    Yan, Xiaojiang
    Wang, Shunsen
    APPLIED SCIENCES-BASEL, 2024, 14 (07):
  • [13] Smart Ship System: Protection of the Marine Environment
    Abdul-Wahab, Sabah A.
    Al-Mammari, Khamis H.
    Al-Kindi, Nasser K.
    Al-Sawafi, Abdullah R.
    ENVIRONMENTAL ENGINEERING SCIENCE, 2009, 26 (03) : 501 - 508
  • [14] Research on ship intelligent autonomous integrated control system
    Guo, Chen
    Shen, Zhipeng
    WCICA 2006: SIXTH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-12, CONFERENCE PROCEEDINGS, 2006, : 8740 - +
  • [15] Design of digital low-carbon system for smart buildings based on PPO algorithm
    Yaohuan Wu
    Nan Xie
    Sustainable Energy Research, 12 (1)
  • [16] USV Collision Avoidance Decision-Making Based on the Improved PPO Algorithm in Restricted Waters
    Hao, Shuhui
    Guan, Wei
    Cui, Zhewen
    Lu, Junwen
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2024, 12 (08)
  • [17] A new configuration of autonomous CHP system based on improved version of marine predators algorithm: A case study
    Wang, Zixin
    Wang, Qiang
    Zhang, Zhi
    Razmjooy, Navid
    INTERNATIONAL TRANSACTIONS ON ELECTRICAL ENERGY SYSTEMS, 2021, 31 (04)
  • [18] Autonomous flying of drone based on ppo reinforcement learning algorithm
    Park S.G.
    Kim D.H.
    Kim, Dong Hwan (dhkim@seoultech.ac.kr), 1600, Institute of Control, Robotics and Systems (26): : 955 - 963
  • [19] Design of Intelligent Firefighting and Smart Escape Route Planning System Based on Improved Ant Colony Algorithm
    Li, Nan
    Shi, Zhuoyong
    Jin, Jiahui
    Feng, Jiahao
    Zhang, Anli
    Xie, Meng
    Min, Liang
    Zhao, Yunfang
    Lei, Yuming
    SENSORS, 2024, 24 (19)
  • [20] Intelligent Emotion Decision System for Autonomous Agents
    Mao, Xia
    Bao, Haiyan
    Li, Zheng
    ISDA 2008: EIGHTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, VOL 1, PROCEEDINGS, 2008, : 189 - 194