Collaborative multi-agents in dynamic industrial internet of things using deep reinforcement learning

被引:3
|
作者
Raza, Ali [1 ]
Shah, Munam Ali [1 ]
Khattak, Hasan Ali [2 ]
Maple, Carsten [3 ]
Al-Turjman, Fadi [4 ]
Rauf, Hafiz Tayyab [5 ]
机构
[1] COMSATS Univ Islamabad, Dept Comp Sci, Islamabad 44000, Pakistan
[2] Natl Univ Sci & Technol NUST, Sch Elect Engn & Comp Sci, Islamabad 44500, Pakistan
[3] Univ Warwick, WMG, Secur Cyber Syst Res Grp, Coventry CV4 7AL, W Midlands, England
[4] Near East Univ, Res Ctr AI & IoT, Artificial Intelligence Dept, Mersin 10, Nicosia, Turkey
[5] Univ Bradford, Fac Engn & Informat, Dept Comp Sci, Bradford BD7 1AZ, W Yorkshire, England
基金
英国工程与自然科学研究理事会;
关键词
Deep reinforcement learning; Multi-agents; Behavior cloning; Dynamic environment; Scalability; OBSTACLE AVOIDANCE; ENVIRONMENT; NAVIGATION; SYSTEMS;
D O I
10.1007/s10668-021-01836-9
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Sustainable cities are envisioned to have economic and industrial steps toward reducing pollution. Many real-world applications such as autonomous vehicles, transportation, traffic signals, and industrial automation can now be trained using deep reinforcement learning (DRL) techniques. These applications are designed to take benefit of DRL in order to improve the monitoring as well as measurements in industrial internet of things for automation identification system. The complexity of these environments means that it is more appropriate to use multi-agent systems rather than a single-agent. However, in non-stationary environments multi-agent systems can suffer from increased number of observations, limiting the scalability of algorithms. This study proposes a model to tackle the problem of scalability in DRL algorithms in transportation domain. A partition-based approach is used in the proposed model to reduce the complexity of the environment. This partition-based approach helps agents to stay in their working area. This reduces the complexity of the learning environment and the number of observations for each agent. The proposed model uses generative adversarial imitation learning and behavior cloning, combined with a proximal policy optimization algorithm, for training multiple agents in a dynamic environment. We present a comparison of PPO, soft actor-critic, and our model in reward gathering. Our simulation results show that our model outperforms SAC and PPO in cumulative reward gathering and dramatically improved training multiple agents.
引用
收藏
页码:9481 / 9499
页数:19
相关论文
共 50 条
  • [31] Converging Game Theory and Reinforcement Learning For Industrial Internet of Things
    Ho, Tai Manh
    Nguyen, Kim Khoa
    Cheriet, Mohamed
    [J]. IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2023, 20 (02): : 890 - 903
  • [32] Game Theoretic Reinforcement Learning Framework For Industrial Internet of Things
    Tai Manh Ho
    Kim-Khoa Nguyen
    Cheriet, Mohamed
    [J]. 2022 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2022, : 2112 - 2117
  • [33] Towards Online Continuous Reinforcement Learning on Industrial Internet of Things
    Qian, Cheng
    Yu, Wei
    Liu, Xing
    Griffith, David
    Golmie, Nada
    [J]. 2021 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, INTERNET OF PEOPLE, AND SMART CITY INNOVATIONS (SMARTWORLD/SCALCOM/UIC/ATC/IOP/SCI 2021), 2021, : 280 - 287
  • [34] Toward Deep Transfer Learning in Industrial Internet of Things
    Liu, Xing
    Yu, Wei
    Liang, Fan
    Griffith, David
    Golmie, Nada
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (15) : 12163 - 12175
  • [35] Ambient Lighting Controller Based on Reinforcement Learning Components of Multi-Agents
    Bielskis, A. A.
    Guseinoviene, E.
    Dzemydiene, D.
    Drungilas, D.
    Gricius, G.
    [J]. ELEKTRONIKA IR ELEKTROTECHNIKA, 2012, 121 (05) : 79 - 84
  • [36] Cooperative Localization for Multi-Agents Based on Reinforcement Learning Compensated Filter
    Wang, Ran
    Xu, Cheng
    Sun, Jing
    Duan, Shihong
    Zhang, Xiaotong
    [J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2024, 42 (10) : 2820 - 2831
  • [37] Toward competitive multi-agents in Polo game based on reinforcement learning
    Zahra Movahedi
    Azam Bastanfard
    [J]. Multimedia Tools and Applications, 2021, 80 : 26773 - 26793
  • [38] Toward competitive multi-agents in Polo game based on reinforcement learning
    Movahedi, Zahra
    Bastanfard, Azam
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (17) : 26773 - 26793
  • [39] NOMA Assisted Multi-Task Multi-Access Mobile Edge Computing via Deep Reinforcement Learning for Industrial Internet of Things
    Qian, Liping
    Wu, Yuan
    Jiang, Fuli
    Yu, Ningning
    Lu, Weidang
    Lin, Bin
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (08) : 5688 - 5698
  • [40] Enabling security for the Industrial Internet of Things using deep learning, blockchain, and coalitions
    Sharma, Mehul
    Pant, Shrid
    Kumar Sharma, Deepak
    Datta Gupta, Koyel
    Vashishth, Vidushi
    Chhabra, Anshuman
    [J]. TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES, 2021, 32 (07)