Smart Underwater Pollution Detection Based on Graph-Based Multi-Agent Reinforcement Learning Towards AUV-Based Network ITS

被引:28
|
作者
Lin, Chuan [1 ]
Han, Guangjie [2 ,3 ]
Zhang, Tongwei [4 ]
Shah, Syed Bilal Hussain [5 ]
Peng, Yan [6 ]
机构
[1] Northeastern Univ, Software Coll, Shenyang 110819, Peoples R China
[2] Hohai Univ, Dept Internet Things Engn, Changzhou 213022, Peoples R China
[3] Chinese Acad Sci, Inst Acoust, State Key Lab Acoust, Beijing 100190, Peoples R China
[4] Natl Deep Sea Ctr, Dept Technol, Qingdao 266237, Peoples R China
[5] Manchester Metropolitan Univ, Sch Comp & Math, Manchester M15 6BH, Lancs, England
[6] Shanghai Univ, Artificial Intelligence Inst, Shanghai 200444, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Pollution; Task analysis; Optimization; Systems architecture; Heuristic algorithms; Computer architecture; Routing; Autonomous underwater vehicle; software-defined networking; 6G; graph-based soft actor critic; intelligent transportation systems; EFFICIENT DATA-COLLECTION; ENERGY-EFFICIENT; INTERNET; SCHEME; ARCHITECTURE;
D O I
10.1109/TITS.2022.3162850
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
The exploitation/utilization of marine resources and the rapid development of urbanization along coastal cities result in serious marine pollution, especially underwater diffusion pollution. It is a non-trivial task to detect the source of diffusion pollution, such that the disadvantageous effect of the pollution can be reduced. With the vision of 6G framework, we employ Autonomous Underwater Vehicle (AUV) flock and introduce the concept of AUV-based network. In particular, we utilize the Software-Defined Networking (SDN) technique to update the controllability of the AUV-based network, leading to the paradigm of SDN-enabled multi-AUVs network Intelligent Transportation Systems (SDNA-ITS). For SDNA-ITS, we utilize artificial potential field theories to model the control model. To optimize the system output, we introduce the graph-based Soft Actor-Critic (SAC) algorithm, i.e., a category of Multi-Agent Reinforcement Learning (MARL) mechanism where each AUV can be regarded as a node in a graph. In particular, we improve the optimization model based on Centralized Training Decentralized Execution (CTDE) architecture with the assistance of the SDN controller, by which each AUV can efficiently adjust its speed towards the diffusion source. Further, to achieve exact path planning for detecting the diffusion source, a dynamic detection scheme is proposed to output the united control policy to schedule the SDNA-ITS dynamically. Simulation results demonstrate that our approaches are available to detect the underwater diffusion source when the actual scenario is taken into account and perform better than some recent research products.
引用
收藏
页码:7494 / 7505
页数:12
相关论文
共 50 条
  • [41] Multi-agent reinforcement learning based on local communication
    Wenxu Zhang
    Lei Ma
    Xiaonan Li
    Cluster Computing, 2019, 22 : 15357 - 15366
  • [42] Cooperative multi-agent game based on reinforcement learning
    Liu, Hongbo
    HIGH-CONFIDENCE COMPUTING, 2024, 4 (01):
  • [43] Survey of Multi-Agent Strategy Based on Reinforcement Learning
    Chen, Liang
    Guo, Ting
    Liu, Yun-ting
    Yang, Jia-ming
    PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020), 2020, : 604 - 609
  • [44] Multi-agent cooperative learning research based on reinforcement learning
    Liu, Fei
    Zeng, Guangzhou
    2006 10TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN, PROCEEDINGS, VOLS 1 AND 2, 2006, : 1408 - 1413
  • [45] Graph Multi-Agent Reinforcement Learning for Inverter-Based Active Voltage Control
    Mu, Chaoxu
    Liu, Zhaoyang
    Yan, Jun
    Jia, Hongjie
    Zhang, Xiaoyu
    IEEE TRANSACTIONS ON SMART GRID, 2024, 15 (02) : 1399 - 1409
  • [46] Path Reasoning over Knowledge Graph: A Multi-Agent and Reinforcement Learning Based Method
    Li, Zixuan
    Jin, Xiaolong
    Guan, Saiping
    Wang, Yuanzhuo
    Cheng, Xueqi
    2018 18TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2018, : 929 - 936
  • [47] Satellite Network Traffic Scheduling Algorithm Based on Multi-Agent Reinforcement Learning
    Zhang, Tingting
    Zhang, Mingqi
    Yang, Lintao
    Dong, Tao
    Yin, Jie
    Liu, Zhihui
    Wu, Jing
    Jiang, Hao
    19TH IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2021), 2021, : 761 - 768
  • [48] A graph neural network based deep reinforcement learning algorithm for multi-agent leader-follower flocking
    Xiao, Jian
    Wang, Zhuoran
    He, Jinhui
    Yuan, Guohui
    INFORMATION SCIENCES, 2023, 641
  • [49] Partially Observable Mean Field Multi-Agent Reinforcement Learning Based on Graph Attention Network for UAV Swarms
    Yang, Min
    Liu, Guanjun
    Zhou, Ziyuan
    Wang, Jiacun
    DRONES, 2023, 7 (07)
  • [50] SMART: Situationally-Aware Multi-Agent Reinforcement Learning-Based Transmissions
    Jiang, Zhiyuan
    Liu, Yan
    Hribar, Jernej
    DaSilva, Luiz A.
    Zhou, Sheng
    Niu, Zhisheng
    IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2021, 7 (04) : 1430 - 1443