Caching Transient Content for IoT Sensing: Multi-Agent Soft Actor-Critic

被引：34

作者：

Wu, Xiongwei ^{[1
]}

Li, Xiuhua ^{[2
,3
]}

Li, Jun ^{[4
]}

Ching, P. C. ^{[1
]}

Leung, Victor C. M. ^{[5
,6
]}

Poor, H. Vincent ^{[7
]}

机构：

[1] Chinese Univ Hong Kong, Dept Elect Engn, Hong Kong, Peoples R China

[2] Chongqing Univ, Sch Big Data & Software Engn, Chongqing 401331, Peoples R China

[3] Chongqing Univ, Minist Educ, Key Lab Dependable Serv Comp Cyber Phys Soc, Chongqing 401331, Peoples R China

[4] Nanjing Univ Sci & Technol, Sch Elect & Opt Engn, Nanjing 210094, Peoples R China

[5] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Peoples R China

[6] Univ British Columbia, Dept Elect & Comp Engn, Vancouver, BC V6T 1Z4, Canada

[7] Princeton Univ, Dept Elect & Comp Engn, Princeton, NJ 08544 USA

来源：

IEEE TRANSACTIONS ON COMMUNICATIONS | 2021年 / 69卷 / 09期

基金：

美国国家科学基金会; 加拿大自然科学与工程研究理事会;

关键词：

Sensors; Wireless sensor networks; Energy consumption; Transient analysis; Sensor phenomena and characterization; Internet of Things; Intelligent sensors; Internet of things; age of information; cooperative multi-agent Markov decision process; soft actor-critic; NETWORKS; INFORMATION; AGE; INTERNET;

D O I：

10.1109/TCOMM.2021.3086535

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Edge nodes (ENs) in Internet of Things commonly serve as gateways to cache sensing data while providing accessing services for data consumers. This paper considers multiple ENs that cache sensing data under the coordination of the cloud. Particularly, each EN can fetch content generated by sensors within its coverage, which can be uploaded to the cloud via fronthaul and then be delivered to other ENs beyond the communication range. However, sensing data are usually transient with time whereas frequent cache updates could lead to considerable energy consumption at sensors and fronthaul traffic loads. Therefore, we adopt Age of Information to evaluate data freshness and investigate intelligent caching policies to preserve data freshness while reducing cache update costs. Specifically, we model the cache update problem as a cooperative multi-agent Markov decision process with the goal of minimizing the long-term average weighted cost. To efficiently handle the exponentially large number of actions, we devise a novel reinforcement learning approach, which is a discrete multi-agent variant of soft actor-critic (SAC). Furthermore, we generalize the proposed approach into a decentralized control, where each EN can make decisions based on local observations only. Simulation results demonstrate the superior performance of the proposed SAC-based caching schemes.

引用

页码：5886 / 5901

页数：16

共 50 条

[1] B -Level Actor-Critic for Multi-Agent Coordination
Zhang, Haifeng
Chen, Weizhe
Huang, Zeren
Li, Minne
Yang, Yaodong
Zhang, Weinan
Wang, Jun
[J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 7325 - 7332
[2] Divergence-Regularized Multi-Agent Actor-Critic
Su, Kefan
Lu, Zongqing
[J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
[3] Dynamic Content Caching Based on Actor-Critic Reinforcement Learning for IoT Systems
Lai, Lifeng
Zheng, Fu-Chun
Wen, Wanli
Luo, Jingjing
Li, Ge
[J]. 2022 IEEE 96TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2022-FALL), 2022,
[4] Actor-Critic Algorithms for Constrained Multi-agent Reinforcement Learning
Diddigi, Raghuram Bharadwaj
Reddy, D. Sai Koti
Prabuchandran, K. J.
Bhatnagar, Shalabh
[J]. AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 1931 - 1933
[5] A New Advantage Actor-Critic Algorithm For Multi-Agent Environments
Paczolay, Gabor
Harmati, Istvan
[J]. 2020 23RD IEEE INTERNATIONAL SYMPOSIUM ON MEASUREMENT AND CONTROL IN ROBOTICS (ISMCR), 2020,
[6] Multi-Agent Natural Actor-Critic Reinforcement Learning Algorithms
Prashant Trivedi
Nandyala Hemachandra
[J]. Dynamic Games and Applications, 2023, 13 : 25 - 55
[7] Improving sample efficiency in Multi-Agent Actor-Critic methods
Ye, Zhenhui
Chen, Yining
Jiang, Xiaohong
Song, Guanghua
Yang, Bowei
Fan, Sheng
[J]. APPLIED INTELLIGENCE, 2022, 52 (04) : 3691 - 3704
[8] Shared Experience Actor-Critic for Multi-Agent Reinforcement Learning
Christianos, Filippos
Schafer, Lukas
Albrecht, Stefano V.
[J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[9] Multi-agent actor-critic with time dynamical opponent model
Tian, Yuan
Kladny, Klaus -Rudolf
Wang, Qin
Huang, Zhiwu
Fink, Olga
[J]. NEUROCOMPUTING, 2023, 517 : 165 - 172
[10] Multi-Agent Actor-Critic with Hierarchical Graph Attention Network
Ryu, Heechang
Shin, Hayong
Park, Jinkyoo
[J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 7236 - 7243

← 1 2 3 4 5 →