UAV Assisted Cooperative Caching on Network Edge Using Multi-Agent Actor-Critic Reinforcement Learning

被引:12
|
作者
Araf, Sadman [1 ]
Saha, Adittya Soukarjya [1 ]
Kazi, Sadia Hamid [1 ]
Tran, Nguyen H. H. [2 ]
Alam, Md. Golam Rabiul [1 ]
机构
[1] Brac Univ, Dept Comp Sci & Engn, Dhaka 1212, Bangladesh
[2] Univ Sydney, Fac Engn, Sch Comp Sci, Sydney, NSW 2006, Australia
关键词
Base stations; Servers; Reinforcement learning; Cooperative caching; Vehicle dynamics; Computational modeling; Cloud computing; Cooperative edge caching; multi-acccess edge computing; multi-agent actor-critic; reinforcement learning; unmanned aerial vehicle (UAV); COMMUNICATION; MANAGEMENT;
D O I
10.1109/TVT.2022.3209079
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In recent times, caching at edge nodes is a well-known technique to overcome the limitation of strict latency, which simultaneously improves users' Quality of Experience (QoE). However, choosing an appropriate caching policy and content placement poses another significant issue that has been acknowledged in this research. Conventional caching policies that are experimented with at the edge do not consider the dynamic and stochastic characteristics of edge caching. As a result, we have proposed a cooperative deep reinforcement learning algorithm that deals with the dynamic nature of content demand. It also ensures efficient use of storage through the cooperation between nodes. In addition, previous works on cooperative caching have assumed the users to be static and didn't consider the mobile nature of users. Therefore, we have proposed UAVs as aerial Base Stations (UAV-BS) to assist in peak hours where a ground base station is insufficient to support the surge in user requests. In this novel research, we have demonstrated the cooperation between aerial and Ground Base Stations (GBS) and aimed at maximizing the global cache hit ratio. Simulations have shown that our proposed Cooperative Multi-Agent Actor-Critic algorithm outperforms conventional and reinforcement learning based caching methods and achieves a state-of-the-art global cache hit ratio when there is a surge in user requests. Thus, it opens the door for further research on cooperative caching in joint air and ground architecture.
引用
收藏
页码:2322 / 2337
页数:16
相关论文
共 50 条
  • [41] An extension of Genetic Network Programming with Reinforcement Learning using actor-critic
    Hatakeyama, Hiroyuki
    Mabu, Shingo
    Hirasawa, Kotaro
    Hu, Jinglu
    [J]. 2006 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-6, 2006, : 1522 - +
  • [42] Multi-Agent Reinforcement Learning Based Cooperative Content Caching for Mobile Edge Networks
    Jiang, Wei
    Feng, Gang
    Qin, Shuang
    Liu, Yijing
    [J]. IEEE ACCESS, 2019, 7 : 61856 - 61867
  • [43] Deep Multi-Agent Reinforcement Learning Based Cooperative Edge Caching in Wireless Networks
    Zhong, Chen
    Gursoy, M. Cenk
    Velipasalar, Senem
    [J]. ICC 2019 - 2019 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2019,
  • [44] AHAC: Actor Hierarchical Attention Critic for Multi-Agent Reinforcement Learning
    Wang, Yajie
    Shi, Dianxi
    Xue, Chao
    Jiang, Hao
    Wang, Gongju
    Gong, Peng
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2020, : 3013 - 3020
  • [45] Multi-agent Gradient-Based Off-Policy Actor-Critic Algorithm for Distributed Reinforcement Learning
    Ren, Jineng
    [J]. INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2024, 17 (01)
  • [46] Multi-UAV Cooperative Air Combat Decision-Making Based on Multi-Agent Double-Soft Actor-Critic
    Li, Shaowei
    Wang, Yongchao
    Zhou, Yaoming
    Jia, Yuhong
    Shi, Hanyue
    Yang, Fan
    Zhang, Chaoyue
    [J]. AEROSPACE, 2023, 10 (07)
  • [47] A New Advantage Actor-Critic Algorithm For Multi-Agent Environments
    Paczolay, Gabor
    Harmati, Istvan
    [J]. 2020 23RD IEEE INTERNATIONAL SYMPOSIUM ON MEASUREMENT AND CONTROL IN ROBOTICS (ISMCR), 2020,
  • [48] Dynamic Content Caching Based on Actor-Critic Reinforcement Learning for IoT Systems
    Lai, Lifeng
    Zheng, Fu-Chun
    Wen, Wanli
    Luo, Jingjing
    Li, Ge
    [J]. 2022 IEEE 96TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2022-FALL), 2022,
  • [49] Improving sample efficiency in Multi-Agent Actor-Critic methods
    Ye, Zhenhui
    Chen, Yining
    Jiang, Xiaohong
    Song, Guanghua
    Yang, Bowei
    Fan, Sheng
    [J]. APPLIED INTELLIGENCE, 2022, 52 (04) : 3691 - 3704
  • [50] Multi-agent actor-critic with time dynamical opponent model
    Tian, Yuan
    Kladny, Klaus -Rudolf
    Wang, Qin
    Huang, Zhiwu
    Fink, Olga
    [J]. NEUROCOMPUTING, 2023, 517 : 165 - 172