TRANSFER REINFORCEMENT LEARNING: FEATURE TRANSFERABILITY IN SHIP COLLISION AVOIDANCE

被引:0
|
作者
Wang, Xinrui [1 ]
Jin, Yan [1 ]
机构
[1] Univ Southern Calif, Dept Aerosp & Mech Engn, Los Angeles, CA 90007 USA
关键词
Artificial intelligence; deep learning; transfer learning; reinforcement learning; collision avoidance; RISK;
D O I
暂无
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The integration of artificial intelligence into engineering work has become increasingly prevalent. Engineering work processes can be highly complex, and learning from scratch requires large computation resources. Transfer learning has emerged as a promising technique for improving learning efficiency by leveraging knowledge gained from related tasks to the target task. To achieve optimal performance, one of the key challenges is to figure out how transferrable the features are among different work processes and within training networks. Simulation-based ship collision avoidance is used for case studies due to its inherent complexity and diversity. Two transfer reinforcement learning methods, feature extraction, and finetuning, are implemented and evaluated against the baseline. Instead of introducing large-scaled pre-trained models as the backbone, a light CNN model pre-trained in a related base case has been proven to transfer essential features to target cases. Simplified ship dynamics is introduced into the training process to make it more realistic and applicable, and the delay caused by the large moment of inertia is addressed by modifying the model-environment interaction mechanism. Work process features for the ship collision avoidance process are concluded from crucial aspects. The effects on transferability are displayed by experimental results discussed from the feature category and similarity perspective.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] WORK PROCESS TRANSFER REINFORCEMENT LEARNING: FEATURE EXTRACTION AND FINETUNING IN SHIP COLLISION AVOIDANCE
    Wang, Xinrui
    Jin, Yan
    PROCEEDINGS OF ASME 2022 INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, IDETC-CIE2022, VOL 2, 2022,
  • [2] Knowledge transfer enabled reinforcement learning for efficient and safe autonomous ship collision avoidance
    Wang, Chengbo
    Wang, Ning
    Gao, Hongbo
    Wang, Leihao
    Zhao, Yizhuo
    Fang, Mingxing
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (09) : 3715 - 3731
  • [3] Ship Collision Avoidance Using Constrained Deep Reinforcement Learning
    Zhang, Rui
    Wang, Xiao
    Liu, Kezhong
    Wu, Xiaolie
    Lu, Tianyou
    Chao Zhaohui
    2018 5TH INTERNATIONAL CONFERENCE ON BEHAVIORAL, ECONOMIC, AND SOCIO-CULTURAL COMPUTING (BESC), 2018, : 115 - 120
  • [4] Deep reinforcement learning-based collision avoidance for an autonomous ship
    Chun, Do-Hyun
    Roh, Myung-Il
    Lee, Hye-Won
    Ha, Jisang
    Yu, Donghun
    OCEAN ENGINEERING, 2021, 234
  • [5] Research on collision avoidance method of intelligent ship navigation based on reinforcement learning
    Yuan, Zhongmi
    Ma, Lei
    Liu, Xiaoqiu
    Zhang, Weibin
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 3220 - 3224
  • [6] Research on autonomous collision avoidance of merchant ship based on inverse reinforcement learning
    Zheng, Mao
    Xie, Shuo
    Chu, Xiumin
    Zhu, Tianquan
    Tian, Guohao
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2020, 17 (06)
  • [7] A composite learning method for multi-ship collision avoidance based on reinforcement learning and inverse control
    Xie, Shuo
    Chu, Xiumin
    Zheng, Mao
    Liu, Chenguang
    NEUROCOMPUTING, 2020, 411 (411) : 375 - 392
  • [8] Automatic ship collision avoidance using deep reinforcement learning with LSTM in continuous action spaces
    Sawada, Ryohei
    Sato, Keiji
    Majima, Takahiro
    JOURNAL OF MARINE SCIENCE AND TECHNOLOGY, 2021, 26 (02) : 509 - 524
  • [9] Ship cooperative collision avoidance strategy based on multi-agent deep reinforcement learning
    Sui L.-R.
    Gao S.
    He W.
    Kongzhi yu Juece/Control and Decision, 2023, 38 (05): : 1395 - 1402
  • [10] CONTROL METHOD FOR PATH FOLLOWING AND COLLISION AVOIDANCE OF AUTONOMOUS SHIP BASED ON DEEP REINFORCEMENT LEARNING
    Zhao, Luman
    Roh, Myung-Il
    Lee, Sung-Jun
    JOURNAL OF MARINE SCIENCE AND TECHNOLOGY-TAIWAN, 2019, 27 (04): : 293 - 310