Decentralized Multi-Agent Bandit Learning for Intelligent Internet of Things Systems

被引:0
|
作者
Leng, Qiuyu [1 ,2 ,3 ]
Wang, Shangshang [1 ]
Huang, Xi [4 ]
Shao, Ziyu [1 ]
Yang, Yang [1 ]
机构
[1] ShanghaiTech Univ, Sch Informat Sci & Technol, Shanghai, Peoples R China
[2] Chinese Acad Sci, Shanghai Inst Microsyst & Informat Technol, Shanghai, Peoples R China
[3] Univ Chinese Acad Sci, Beijing, Peoples R China
[4] Shenzhen Inst Artificial Intelligence & Robot Soc, Shenzhen, Peoples R China
关键词
Intelligent Internet of Things systems; data heterogeneity; multi-agent bandit learning; IOT;
D O I
10.1109/WCNC51071.2022.9771884
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In intelligent Internet of Things systems, data-hungry services are empowered by data collection, which is jointly accomplished by edge servers and data-collecting sensors. In this paper, we aim to achieve efficient data collection, i.e., maximize data rates from sensors to servers while mitigating the impact of data heterogeneity for data collected from sensors. Considering geographically distributed servers and sensors, we study the problem from the perspective of multi-agent multi-armed bandits. The key ideas of our approach are to 1) establish associations between servers and sensors under unknown wireless dynamics (i.e., channel state information) and selection fraction constraints; 2) utilize shared information via pairwise communication between servers to mitigate biased observations for data rates. To this end, we propose a scheme that leverages online learning to reduce uncertainties in wireless dynamics and online control to mitigate the impact of data heterogeneity. Based on an effective integration of bandit learning methods under pairwise communication and Lyapunov optimization techniques, we present a novel Decentralized sErver-Sensor association scheme with Multi-Agent learning under pairwise communication (DESMA). Our theoretical analysis demonstrates that DESMA achieves a tunable trade-off between maximizing data rate and mitigating the impact of data heterogeneity.
引用
收藏
页码:2118 / 2123
页数:6
相关论文
共 50 条
  • [31] Multi-agent Reinforcement Learning for Decentralized Stable Matching
    Taywade, Kshitija
    Goldsmith, Judy
    Harrison, Brent
    [J]. ALGORITHMIC DECISION THEORY, ADT 2021, 2021, 13023 : 375 - 389
  • [32] Multi-agent reinforcement learning as a rehearsal for decentralized planning
    Kraemer, Landon
    Banerjee, Bikramjit
    [J]. NEUROCOMPUTING, 2016, 190 : 82 - 94
  • [33] Decentralized and Asymmetric Multi-Agent Learning in Construction Sites
    Miron, Yakov
    Navon, Dan
    Goldfracht, Yuval
    Castro, Dotan Di
    Klein, Itzik
    [J]. IEEE Open Journal of Vehicular Technology, 2024, 5 : 1587 - 1599
  • [34] Decentralized Multi-agent Reinforcement Learning with Shared Actions
    Mishra, Rajesh K.
    Vasal, Deepanshu
    Vishwanath, Sriram
    [J]. 2021 55TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2021,
  • [35] Mobile Multi-Agent Systems for the Internet-of-Things and Clouds using the Java']JavaScript Agent Machine Platform and Machine Learning as a Service
    Bosse, Stefan
    [J]. 2016 IEEE 4TH INTERNATIONAL CONFERENCE ON FUTURE INTERNET OF THINGS AND CLOUD (FICLOUD 2016), 2016, : 246 - 255
  • [36] Multi-agent learning within an Internet environment
    Tillotson, PRJ
    Wu, QH
    Hughes, PM
    [J]. INTELLIGENT CONTROL SYSTEMS AND SIGNAL PROCESSING 2003, 2003, : 117 - 122
  • [37] ADP-Based Intelligent Decentralized Control for Multi-Agent Systems Moving in Obstacle Environment
    Lan, Xuejing
    Liu, Lei
    Wang, Yongji
    [J]. IEEE ACCESS, 2019, 7 : 59624 - 59630
  • [38] Semantic reliability of multi-agent intelligent systems
    Sundresh, Tippure S.
    [J]. BELL LABS TECHNICAL JOURNAL, 2006, 11 (03) : 225 - 236
  • [39] MASA: Multi-agent Subjectivity Alignment for Trustworthy Internet of Things
    Zeynalvand, Leonit
    Zhang, Jie
    Luo, Tony T.
    Chen, Shuo
    [J]. 2018 21ST INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2018, : 2013 - 2020
  • [40] Intelligent Multi-agent based Convergence Systems
    Cho, Young Im
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3, 2009, : 2136 - 2141