Smart Underwater Pollution Detection Based on Graph-Based Multi-Agent Reinforcement Learning Towards AUV-Based Network ITS

被引:28
|
作者
Lin, Chuan [1 ]
Han, Guangjie [2 ,3 ]
Zhang, Tongwei [4 ]
Shah, Syed Bilal Hussain [5 ]
Peng, Yan [6 ]
机构
[1] Northeastern Univ, Software Coll, Shenyang 110819, Peoples R China
[2] Hohai Univ, Dept Internet Things Engn, Changzhou 213022, Peoples R China
[3] Chinese Acad Sci, Inst Acoust, State Key Lab Acoust, Beijing 100190, Peoples R China
[4] Natl Deep Sea Ctr, Dept Technol, Qingdao 266237, Peoples R China
[5] Manchester Metropolitan Univ, Sch Comp & Math, Manchester M15 6BH, Lancs, England
[6] Shanghai Univ, Artificial Intelligence Inst, Shanghai 200444, Peoples R China
基金
中国博士后科学基金; 中国国家自然科学基金;
关键词
Pollution; Task analysis; Optimization; Systems architecture; Heuristic algorithms; Computer architecture; Routing; Autonomous underwater vehicle; software-defined networking; 6G; graph-based soft actor critic; intelligent transportation systems; EFFICIENT DATA-COLLECTION; ENERGY-EFFICIENT; INTERNET; SCHEME; ARCHITECTURE;
D O I
10.1109/TITS.2022.3162850
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
The exploitation/utilization of marine resources and the rapid development of urbanization along coastal cities result in serious marine pollution, especially underwater diffusion pollution. It is a non-trivial task to detect the source of diffusion pollution, such that the disadvantageous effect of the pollution can be reduced. With the vision of 6G framework, we employ Autonomous Underwater Vehicle (AUV) flock and introduce the concept of AUV-based network. In particular, we utilize the Software-Defined Networking (SDN) technique to update the controllability of the AUV-based network, leading to the paradigm of SDN-enabled multi-AUVs network Intelligent Transportation Systems (SDNA-ITS). For SDNA-ITS, we utilize artificial potential field theories to model the control model. To optimize the system output, we introduce the graph-based Soft Actor-Critic (SAC) algorithm, i.e., a category of Multi-Agent Reinforcement Learning (MARL) mechanism where each AUV can be regarded as a node in a graph. In particular, we improve the optimization model based on Centralized Training Decentralized Execution (CTDE) architecture with the assistance of the SDN controller, by which each AUV can efficiently adjust its speed towards the diffusion source. Further, to achieve exact path planning for detecting the diffusion source, a dynamic detection scheme is proposed to output the united control policy to schedule the SDNA-ITS dynamically. Simulation results demonstrate that our approaches are available to detect the underwater diffusion source when the actual scenario is taken into account and perform better than some recent research products.
引用
收藏
页码:7494 / 7505
页数:12
相关论文
共 50 条
  • [31] Conditional Synchronization in Multi-Agent Graph-based Knowledge System
    Kotulski, Leszek
    Sedziwy, Adam
    Strug, Barbara
    INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE, ICCS 2015 COMPUTATIONAL SCIENCE AT THE GATES OF NATURE, 2015, 51 : 1043 - 1051
  • [32] Towards effectively feature graph-based IoT botnet detection via reinforcement learning
    Quoc-Dung Ngo
    Huy-Trung Nguyen
    Le-Cuong Nguyen
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 41 (06) : 6801 - 6814
  • [33] A Multi-agent Reinforcement Learning-Based Transmission Protocol for Underwater Acoustic Networks
    Gao, Yu
    Bi, Zhicheng
    Wang, Chaofeng
    17TH ACM INTERNATIONAL CONFERENCE ON UNDERWATER NETWORKS & SYSTEMS, WUWNET 2023, 2024,
  • [34] GMIX: Graph-based spatial-temporal multi-agent reinforcement learning for dynamic electric vehicle dispatching system
    Zhou, Tao
    Kris, M. Y. Law
    Creighton, Douglas
    Wu, Changzhi
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2022, 144
  • [35] Resources allocation for underwater acoustic soft frequency reuse network based on multi-agent deep reinforcement learning
    Zhang, Yuzhi
    Li, Mengfan
    Feng, Xiaomei
    Han, Xiang
    Jia, Menglei
    PHYSICAL COMMUNICATION, 2024, 67
  • [36] Graph Attention Network-Based Multi-Agent Reinforcement Learning for Slicing Resource Management in Dense Cellular Network
    Shao, Yan
    Li, Rongpeng
    Hu, Bing
    Wu, Yingxiao
    Zhao, Zhifeng
    Zhang, Honggang
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (10) : 10792 - 10803
  • [37] Multi-agent reinforcement learning based on local communication
    Zhang, Wenxu
    Ma, Lei
    Li, Xiaonan
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 6): : 15357 - 15366
  • [38] Multi-agent Cooperative Search based on Reinforcement Learning
    Sun, Yinjiang
    Zhang, Rui
    Liang, Wenbao
    Xu, Cheng
    PROCEEDINGS OF 2020 3RD INTERNATIONAL CONFERENCE ON UNMANNED SYSTEMS (ICUS), 2020, : 891 - 896
  • [39] Hierarchical Multi-Agent Training Based on Reinforcement Learning
    Wang, Guanghua
    Li, Wenjie
    Wu, Zhanghua
    Guo, Xian
    2024 9TH ASIA-PACIFIC CONFERENCE ON INTELLIGENT ROBOT SYSTEMS, ACIRS, 2024, : 11 - 18
  • [40] Function approximation based multi-agent reinforcement learning
    Abul, O
    Polat, F
    Alhajj, R
    12TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2000, : 36 - 39