Reinforcement Learning Based Underwater Wireless Optical Communication Alignment for Autonomous Underwater Vehicles

被引:9
|
作者
Weng, Yang [1 ]
Pajarinen, Joni [2 ,3 ]
Akrour, Riad [2 ]
Matsuda, Takumi [1 ,4 ]
Peters, Jan [2 ]
Maki, Toshihiro [1 ]
机构
[1] Univ Tokyo, Inst Ind Sci, Tokyo 1538505, Japan
[2] Tech Univ Darmstadt, Intelligent Autonomous Syst Lab, D-64289 Darmstadt, Germany
[3] Aalto Univ, Dept Elect Engn & Automat, Espoo 02150, Finland
[4] Meiji Univ, Sch Sci & Technol, Kawasaki, Kanagawa 2148571, Japan
关键词
Autonomous underwater vehicle (AUV); partially observable Markov decision process (POMDP); reinforcement learning (RL); soft actor-critic (SAC); underwater wireless optical communication (UWOC);
D O I
10.1109/JOE.2022.3165805
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
With the developments in underwater wireless optical communication (UWOC) technology, UWOC can be used in conjunction with autonomous underwater vehicles (AUVs) for high-speed data sharing among the vehicle formation during underwater exploration. A beam alignment problem arises during communication due to the transmission range, external disturbances and noise, and uncertainties in the AUV dynamic model. In this article, we propose an acoustic navigation method to guide the alignment process without requiring beam directors, light intensity sensors, and/or scanning algorithms as used in previous research. The AUVs need stably maintain a specific relative position and orientation for establishing an optical link. We model the alignment problem as a partially observable Markov decision process (POMDP) that takes manipulation, navigation, and energy consumption of underwater vehicles into account. However, finding an efficient policy for the POMDP under high partial observability and environmental variability is challenging. Therefore, for successful policy optimization, we utilize the soft actor-critic reinforcement learning algorithm together with AUV-specific belief updates and reward shaping based curriculum learning. Our approach outperformed baseline approaches in a simulation environment and successfully performed the beam alignment process from one AUV to another on the real AUV Tri-TON 2.
引用
收藏
页码:1231 / 1245
页数:15
相关论文
共 50 条
  • [1] Effect of Transporter Autonomous Underwater Vehicles for Underwater Optical Wireless Communication Considering Delay Tolerant Networks
    Matsuo, Keita
    Kulla, Elis
    Barolli, Leonard
    [J]. ADVANCES IN NETWORK-BASED INFORMATION SYSTEMS, NBIS-2022, 2022, 526 : 172 - 181
  • [2] OPTIMIZATION OF A SWARM OF AUTONOMOUS UNDERWATER VEHICLES FOR HIGH -BANDWIDTH UNDERWATER WIRELESS COMMUNICATION
    Immas, Alexandre
    Alam, Mohammad-Reza
    [J]. PROCEEDINGS OF THE ASME 38TH INTERNATIONAL CONFERENCE ON OCEAN, OFFSHORE AND ARCTIC ENGINEERING, 2019, VOL 7B, 2019,
  • [3] Multidimensional Beam Optimization in Underwater Optical Wireless Communication Based on Deep Reinforcement Learning
    Shin, Huicheol
    Baek, Seungjae
    Song, Yujae
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (17): : 28623 - 28634
  • [4] HIGH-BANDWIDTH UNDERWATER WIRELESS COMMUNICATION USING A SWARM OF AUTONOMOUS UNDERWATER VEHICLES
    Immas, Alexandre
    Saadat, Mohsen
    Navarro, Jesus
    Drake, Matthew
    Shen, Julie
    Alam, Mohammad-Reza
    [J]. PROCEEDINGS OF THE ASME 38TH INTERNATIONAL CONFERENCE ON OCEAN, OFFSHORE AND ARCTIC ENGINEERING, 2019, VOL 7B, 2019,
  • [5] Reinforcement learning: The application to autonomous biomimetic underwater vehicles control
    Magalhaes, J.
    Damas, B.
    Lobo, V.
    [J]. 4TH INTERNATIONAL SCIENTIFIC CONFERENCE SEA-CONF 2018, 2018, 172
  • [6] A behavior-based scheme using reinforcement learning for autonomous underwater vehicles
    Carreras, M
    Yuh, J
    Batlle, J
    Ridao, P
    [J]. IEEE JOURNAL OF OCEANIC ENGINEERING, 2005, 30 (02) : 416 - 427
  • [7] Fuzzy Rule-Based Signal Restoration for Wireless Optical Communication of Multi-autonomous Underwater Vehicles
    Yu Hao
    Yuzong Gu
    Yanbo Zhang
    [J]. International Journal of Fuzzy Systems, 2021, 23 : 1840 - 1848
  • [8] Fuzzy Rule-Based Signal Restoration for Wireless Optical Communication of Multi-autonomous Underwater Vehicles
    Hao, Yu
    Gu, Yuzong
    Zhang, Yanbo
    [J]. INTERNATIONAL JOURNAL OF FUZZY SYSTEMS, 2021, 23 (06) : 1840 - 1848
  • [9] Reinforcement Learning-Based Formation Control of Autonomous Underwater Vehicles with Model Interferences
    Cao, Wenqiang
    Yan, Jing
    Yang, Xian
    Luo, Xiaoyuan
    [J]. 2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 4020 - 4025
  • [10] Scalable laser-based underwater wireless optical communication solution between autonomous underwater vehicle fleets
    Weng, Yang
    Sekimori, Yuki
    Chun, Sehwa
    Alkhazragi, Omar
    Matsuda, Takumi
    Trichili, Abderrahmen
    Ng, Tien Khee
    Ooi, Boon S.
    Maki, Toshihiro
    [J]. APPLIED OPTICS, 2023, 62 (31) : 8261 - 8271