Harnessing Online Knowledge Transfer for Enhanced Search and Rescue Decisions via Multi-Agent Reinforcement Learning

被引:0
|
作者
Song, Luona [1 ]
Wen, Zhigang [2 ]
Teng, Junjie [2 ]
Zhang, Jian [1 ]
Nicolas, Merveille [3 ]
机构
[1] Beijing Informat Sci & Technol Univ, Sch Econ & Management, Beijing 100192, Peoples R China
[2] Beijing Univ Posts & Telecommun, Sch Elect Engn, Beijing 100876, Peoples R China
[3] Univ Quebec Montreal, Dept Strategy & Social & Environm Responsibil, Montreal, PQ H3C 3P8, Canada
关键词
search and rescue (SAR); Internet of Things (IoT); deep deterministic policy gradient (DDPG); online knowledge transfer; soft target generation technique; cooperative games; competitive games;
D O I
10.3390/su152416741
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
In the rapidly evolving domain of the Internet of Things (IoT), devices play an instrumental role in high-stakes scenarios like search and rescue (SAR) operations. Traditional decision-making processes within SAR missions often struggle to cope with the dynamic and unpredictable nature of such environments, leading to inefficiencies and delayed responses. This paper aims to explore the potential of multi-agent reinforcement learning (MARL) to improve the decision-making process within SAR operations underpinned by IoT. Functional, current methods are limited by their static decision frameworks and inability to adapt in real time to the chaotic variables present in SAR situations. We introduced a novel MARL framework and compared its performance against benchmark strategies, specifically the multi-agent deep deterministic policy gradient (MADDPG) approach. Uniquely enhanced by online knowledge transfer, the framework leverages the capabilities of the deep deterministic policy gradient (DDPG) method. The preliminary findings underscore the proposed framework's superior efficiency and speed in SAR contexts. Our research highlights MARL's transformative potential, positing it as a groundbreaking strategy for IoT-based decision making in high-pressure SAR environments with suggestions for further studies in varied real-world scenarios.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] AdverSAR: Adversarial Search and Rescue via Multi-Agent Reinforcement Learning
    Rahman, Aowabin
    Bhattacharya, Arnab
    Ramachandran, Thiagarajan
    Mukherjee, Sayak
    Sharma, Himanshu
    Fujimoto, Ted
    Chatterjee, Samrat
    2022 IEEE INTERNATIONAL SYMPOSIUM ON TECHNOLOGIES FOR HOMELAND SECURITY (HST), 2022,
  • [2] Multi-agent reinforcement learning via knowledge transfer with differentially private noise
    Cheng, Zishuo
    Ye, Dayong
    Zhu, Tianqing
    Zhou, Wanlei
    Yu, Philip S.
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2022, 37 (01) : 799 - 828
  • [3] Knowledge Transfer in Multi-Objective Multi-Agent Reinforcement Learning via Generalized Policy Improvement
    de Almeida, Vicente N.
    Alegre, Lucas N.
    Bazzan, Ana L. C.
    COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2024, 21 (01) : 335 - 362
  • [4] KnowRU: Knowledge Reuse via Knowledge Distillation in Multi-Agent Reinforcement Learning
    Gao, Zijian
    Xu, Kele
    Ding, Bo
    Wang, Huaimin
    ENTROPY, 2021, 23 (08)
  • [5] Knowledge transfer in multi-agent reinforcement learning with incremental number of agents
    Liu Wenzhang
    Dong Lu
    Liu Jian
    Sun Changyin
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2022, 33 (02) : 447 - 460
  • [6] Knowledge transfer in multi-agent reinforcement learning with incremental number of agents
    LIU Wenzhang
    DONG Lu
    LIU Jian
    SUN Changyin
    JournalofSystemsEngineeringandElectronics, 2022, 33 (02) : 447 - 460
  • [7] Multi-Agent Common Knowledge Reinforcement Learning
    de Witt, Christian A. Schroeder
    Foerster, Jakob N.
    Farquhar, Gregory
    Torr, Philip H. S.
    Boehmer, Wendelin
    Whiteson, Shimon
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [8] Aggregation Transfer Learning for Multi-Agent Reinforcement learning
    Xu, Dongsheng
    Qiao, Peng
    Dou, Yong
    2021 2ND INTERNATIONAL CONFERENCE ON BIG DATA & ARTIFICIAL INTELLIGENCE & SOFTWARE ENGINEERING (ICBASE 2021), 2021, : 547 - 551
  • [9] Hierarchical reinforcement learning via dynamic subspace search for multi-agent planning
    Ma, Aaron
    Ouimet, Michael
    Cortes, Jorge
    AUTONOMOUS ROBOTS, 2020, 44 (3-4) : 485 - 503
  • [10] Hierarchical reinforcement learning via dynamic subspace search for multi-agent planning
    Aaron Ma
    Michael Ouimet
    Jorge Cortés
    Autonomous Robots, 2020, 44 : 485 - 503