Harnessing Online Knowledge Transfer for Enhanced Search and Rescue Decisions via Multi-Agent Reinforcement Learning

被引:0
|
作者
Song, Luona [1 ]
Wen, Zhigang [2 ]
Teng, Junjie [2 ]
Zhang, Jian [1 ]
Nicolas, Merveille [3 ]
机构
[1] Beijing Informat Sci & Technol Univ, Sch Econ & Management, Beijing 100192, Peoples R China
[2] Beijing Univ Posts & Telecommun, Sch Elect Engn, Beijing 100876, Peoples R China
[3] Univ Quebec Montreal, Dept Strategy & Social & Environm Responsibil, Montreal, PQ H3C 3P8, Canada
关键词
search and rescue (SAR); Internet of Things (IoT); deep deterministic policy gradient (DDPG); online knowledge transfer; soft target generation technique; cooperative games; competitive games;
D O I
10.3390/su152416741
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
In the rapidly evolving domain of the Internet of Things (IoT), devices play an instrumental role in high-stakes scenarios like search and rescue (SAR) operations. Traditional decision-making processes within SAR missions often struggle to cope with the dynamic and unpredictable nature of such environments, leading to inefficiencies and delayed responses. This paper aims to explore the potential of multi-agent reinforcement learning (MARL) to improve the decision-making process within SAR operations underpinned by IoT. Functional, current methods are limited by their static decision frameworks and inability to adapt in real time to the chaotic variables present in SAR situations. We introduced a novel MARL framework and compared its performance against benchmark strategies, specifically the multi-agent deep deterministic policy gradient (MADDPG) approach. Uniquely enhanced by online knowledge transfer, the framework leverages the capabilities of the deep deterministic policy gradient (DDPG) method. The preliminary findings underscore the proposed framework's superior efficiency and speed in SAR contexts. Our research highlights MARL's transformative potential, positing it as a groundbreaking strategy for IoT-based decision making in high-pressure SAR environments with suggestions for further studies in varied real-world scenarios.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] PRIMAL: Pathfinding via Reinforcement and Imitation Multi-Agent Learning
    Sartoretti, Guillaume
    Kerr, Justin
    Shi, YunFei
    Wagner, Glenn
    Kumar, T. K. Satish
    Koenig, Sven
    Choset, Howie
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2019, 4 (03): : 2378 - 2385
  • [42] MULTI-AGENT REINFORCEMENT LEARNING WITH CONTRIBUTIONBASED ASSIGNMENT ONLINE ROUTING IN SDN
    Yue Xiaofeng
    Wu Lijun
    Duan Weiwei
    2022 19TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2022,
  • [43] Online Reinforcement Learning in Multi-Agent Systems for Distributed Energy Systems
    Menon, Bharat R.
    Menon, Sangeetha B.
    Srinivasan, Dipti
    Jain, Lakhmi
    2014 IEEE INNOVATIVE SMART GRID TECHNOLOGIES - ASIA (ISGT ASIA), 2014, : 791 - 796
  • [44] Online optimization of traffic policy through multi-agent reinforcement learning
    Sasaki, Y
    Flann, NS
    PROCEEDINGS OF THE 7TH JOINT CONFERENCE ON INFORMATION SCIENCES, 2003, : 1211 - 1214
  • [45] Hierarchical Control of Multi-Agent Systems using Online Reinforcement Learning
    Bai, He
    George, Jemin
    Chakrabortty, Aranya
    2020 AMERICAN CONTROL CONFERENCE (ACC), 2020, : 340 - 345
  • [46] Online Multi-Agent Reinforcement Learning for Multiple Access in Wireless Networks
    Xiao, Jianbin
    Chen, Zhenyu
    Sun, Xinghua
    Zhan, Wen
    Wang, Xijun
    Chen, Xiang
    IEEE COMMUNICATIONS LETTERS, 2023, 27 (12) : 3250 - 3254
  • [47] Co-Evolving Multi-Agent Transfer Reinforcement Learning via Scenario Independent Representation
    Siddiqua, Ayesha
    Liu, Siming
    Nipu, Ayesha Siddika
    Harris, Anthony
    Liu, Yan
    IEEE ACCESS, 2024, 12 : 99439 - 99451
  • [48] Learning to Collaborate: Multi-Scenario Ranking via Multi-Agent Reinforcement Learning
    Feng, Jun
    Li, Heng
    Huang, Minlie
    Liu, Shichen
    Ou, Wenwu
    Wang, Zhirong
    Zhu, Xiaoyan
    WEB CONFERENCE 2018: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW2018), 2018, : 1939 - 1948
  • [49] Enhanced Cooperative Multi-agent Learning Algorithms (ECMLA) using Reinforcement Learning
    Vidhate, Deepak A.
    Kulkarni, Parag
    2016 INTERNATIONAL CONFERENCE ON COMPUTING, ANALYTICS AND SECURITY TRENDS (CAST), 2016, : 556 - 561
  • [50] Knowledge distillation for portfolio management using multi-agent reinforcement learning
    Chen, Min-You
    Chen, Chiao-Ting
    Huang, Szu-Hao
    ADVANCED ENGINEERING INFORMATICS, 2023, 57