Proactive Content Caching Based on Actor-Critic Reinforcement Learning for Mobile Edge Networks

被引:11
|
作者
Jiang, Wei [1 ]
Feng, Daquan [1 ]
Sun, Yao [2 ]
Feng, Gang [3 ,4 ]
Wang, Zhenzhong [5 ]
Xia, Xiang-Gen [6 ]
机构
[1] Shenzhen Univ, Guangdong Prov Engn Lab Digital Creat Technol, Shenzhen Key Lab Digital Creat Technol, Coll Elect & Informat Engn,Guangdong Key Lab Inte, Shenzhen 518060, Peoples R China
[2] Univ Glasgow, James Watt Sch Engn, Glasgow G12 8QQ, Lanark, Scotland
[3] Univ Elect Sci & Technol China, Yangtze Delta Reg Inst Huzhou, Huzhou 313001, Scotland
[4] Univ Elect Sci & Technol China, Natl Key Lab Sci & Technol Commun, Chengdu 611731, Peoples R China
[5] Tech Management Ctr, China Media Grp, Beijing 100020, Peoples R China
[6] Univ Delaware, Dept Elect & Comp Engn, Newark, DE 19716 USA
基金
国家重点研发计划;
关键词
Actor-critic algorithm; branching neural network; reinforcement learning; mobile edge caching; 5G NETWORKS; SMALL-CELL; DELIVERY; POLICY;
D O I
10.1109/TCCN.2021.3130995
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Mobile edge caching/computing (MEC) has emerged as a promising approach for addressing the drastic increasing mobile data traffic by bringing high caching and computing capabilities to the edge of networks. Under MEC architecture, content providers (CPs) are allowed to lease some virtual machines (VMs) at MEC servers to proactively cache popular contents for improving users' quality of experience. The scalable cache resource model rises the challenge for determining the ideal number of leased VMs for CPs to obtain the minimum expected downloading delay of users at the lowest caching cost. To address these challenges, in this paper, we propose an actor-critic (AC) reinforcement learning based proactive caching policy for mobile edge networks without the prior knowledge of users' content demand. Specifically, we formulate the proactive caching problem under dynamical users' content demand as a Markov decision process and propose a AC based caching algorithm to minimize the caching cost and the expected downloading delay. Particularly, to reduce the computational complexity, a branching neural network is employed to approximate the policy function in the actor part. Numerical results show that the proposed caching algorithm can significantly reduce the total cost and the average downloading delay when compared with other popular algorithms.
引用
收藏
页码:1239 / 1252
页数:14
相关论文
共 50 条
  • [1] Dynamic Content Caching Based on Actor-Critic Reinforcement Learning for IoT Systems
    Lai, Lifeng
    Zheng, Fu-Chun
    Wen, Wanli
    Luo, Jingjing
    Li, Ge
    [J]. 2022 IEEE 96TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2022-FALL), 2022,
  • [2] An actor-critic reinforcement learning-based resource management in mobile edge computing systems
    Fu, Fang
    Zhang, Zhicai
    Yu, Fei Richard
    Yan, Qiao
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2020, 11 (08) : 1875 - 1889
  • [3] An actor-critic reinforcement learning-based resource management in mobile edge computing systems
    Fang Fu
    Zhicai Zhang
    Fei Richard Yu
    Qiao Yan
    [J]. International Journal of Machine Learning and Cybernetics, 2020, 11 : 1875 - 1889
  • [4] Heterogeneous Edge Caching Based on Actor-Critic Learning With Attention Mechanism Aiding
    Wang, Chenyang
    Li, Ruibin
    Wang, Xiaofei
    Taleb, Tarik
    Guo, Song
    Sun, Yuxia
    Leung, Victor C. M.
    [J]. IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2023, 10 (06): : 3409 - 3420
  • [5] Actor-Critic based Improper Reinforcement Learning
    Zaki, Mohammadi
    Mohan, Avinash
    Gopalan, Aditya
    Mannor, Shie
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [6] UAV Assisted Cooperative Caching on Network Edge Using Multi-Agent Actor-Critic Reinforcement Learning
    Araf, Sadman
    Saha, Adittya Soukarjya
    Kazi, Sadia Hamid
    Tran, Nguyen H. H.
    Alam, Md. Golam Rabiul
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (02) : 2322 - 2337
  • [7] An Actor-Critic Deep Reinforcement Learning Based Computation Offloading for Three-tier Mobile Computing Networks
    Liu, Yu
    Cui, Qimei
    Zhang, Jian
    Chen, Yu
    Hou, Yanzhao
    [J]. 2019 11TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2019,
  • [8] A World Model for Actor-Critic in Reinforcement Learning
    Panov, A. I.
    Ugadiarov, L. A.
    [J]. PATTERN RECOGNITION AND IMAGE ANALYSIS, 2023, 33 (03) : 467 - 477
  • [9] Curious Hierarchical Actor-Critic Reinforcement Learning
    Roeder, Frank
    Eppe, Manfred
    Nguyen, Phuong D. H.
    Wermter, Stefan
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT II, 2020, 12397 : 408 - 419
  • [10] Integrated Actor-Critic for Deep Reinforcement Learning
    Zheng, Jiaohao
    Kurt, Mehmet Necip
    Wang, Xiaodong
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT IV, 2021, 12894 : 505 - 518