Harnessing Online Knowledge Transfer for Enhanced Search and Rescue Decisions via Multi-Agent Reinforcement Learning

被引:0
|
作者
Song, Luona [1 ]
Wen, Zhigang [2 ]
Teng, Junjie [2 ]
Zhang, Jian [1 ]
Nicolas, Merveille [3 ]
机构
[1] Beijing Informat Sci & Technol Univ, Sch Econ & Management, Beijing 100192, Peoples R China
[2] Beijing Univ Posts & Telecommun, Sch Elect Engn, Beijing 100876, Peoples R China
[3] Univ Quebec Montreal, Dept Strategy & Social & Environm Responsibil, Montreal, PQ H3C 3P8, Canada
关键词
search and rescue (SAR); Internet of Things (IoT); deep deterministic policy gradient (DDPG); online knowledge transfer; soft target generation technique; cooperative games; competitive games;
D O I
10.3390/su152416741
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
In the rapidly evolving domain of the Internet of Things (IoT), devices play an instrumental role in high-stakes scenarios like search and rescue (SAR) operations. Traditional decision-making processes within SAR missions often struggle to cope with the dynamic and unpredictable nature of such environments, leading to inefficiencies and delayed responses. This paper aims to explore the potential of multi-agent reinforcement learning (MARL) to improve the decision-making process within SAR operations underpinned by IoT. Functional, current methods are limited by their static decision frameworks and inability to adapt in real time to the chaotic variables present in SAR situations. We introduced a novel MARL framework and compared its performance against benchmark strategies, specifically the multi-agent deep deterministic policy gradient (MADDPG) approach. Uniquely enhanced by online knowledge transfer, the framework leverages the capabilities of the deep deterministic policy gradient (DDPG) method. The preliminary findings underscore the proposed framework's superior efficiency and speed in SAR contexts. Our research highlights MARL's transformative potential, positing it as a groundbreaking strategy for IoT-based decision making in high-pressure SAR environments with suggestions for further studies in varied real-world scenarios.
引用
收藏
页数:18
相关论文
共 50 条
  • [31] Improving Multi-agent Reinforcement Learning with Imperfect Human Knowledge
    Han, Xiaoxu
    Tang, Hongyao
    Li, Yuan
    Kou, Guang
    Liu, Leilei
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT II, 2020, 12397 : 369 - 380
  • [32] A configuration of multi-agent reinforcement learning integrating prior knowledge
    Tang, Hainan
    Tang, Hongjie
    Liu, Juntao
    Rao, Ziyun
    Zhang, Yunshu
    Luo, Xunhao
    2024 2ND ASIA CONFERENCE ON COMPUTER VISION, IMAGE PROCESSING AND PATTERN RECOGNITION, CVIPPR 2024, 2024,
  • [33] Knowledge Reuse of Multi-Agent Reinforcement Learning in Cooperative Tasks
    Shi, Daming
    Tong, Junbo
    Liu, Yi
    Fan, Wenhui
    ENTROPY, 2022, 24 (04)
  • [34] Multi-Agent Navigation with Reinforcement Learning Enhanced Information Seeking
    Zhang, Siwei
    Guerra, Anna
    Guidi, Francesco
    Dardari, Davide
    Djuric, Petar M.
    2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 982 - 986
  • [35] Knowledge graph-enhanced multi-agent reinforcement learning for adaptive scheduling in smart manufacturing
    Qin, Zhaojun
    Lu, Yuqian
    JOURNAL OF INTELLIGENT MANUFACTURING, 2024,
  • [36] Multi-Agent Systems for Search and Rescue Applications
    Daniel S. Drew
    Current Robotics Reports, 2021, 2 (2): : 189 - 200
  • [37] Output synchronization of multi-agent systems via reinforcement learning
    Liu, Yingying
    Wang, Zhanshan
    NEUROCOMPUTING, 2022, 508 : 110 - 119
  • [38] Network Maintenance Planning Via Multi-Agent Reinforcement Learning
    Thomas, Jonathan
    Hernandez, Marco Perez
    Parlikad, Ajith Kumar
    Piechocki, Robert
    2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 2289 - 2295
  • [39] IntelligentCrowd: Mobile Crowdsensing via Multi-Agent Reinforcement Learning
    Chen, Yize
    Wang, Hao
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2021, 5 (05): : 840 - 845
  • [40] Safe Multi-Agent Reinforcement Learning via Dynamic Shielding
    Qiu, Yunbo
    Jin, Yue
    Yu, Lebin
    Wang, Jian
    Zhang, Xudong
    2024 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI 2024, 2024, : 1254 - 1257