Proactive Content Caching Based on Actor-Critic Reinforcement Learning for Mobile Edge Networks

被引:16
|
作者
Jiang, Wei [1 ]
Feng, Daquan [1 ]
Sun, Yao [2 ]
Feng, Gang [3 ,4 ]
Wang, Zhenzhong [5 ]
Xia, Xiang-Gen [6 ]
机构
[1] Shenzhen Univ, Guangdong Prov Engn Lab Digital Creat Technol, Shenzhen Key Lab Digital Creat Technol, Coll Elect & Informat Engn,Guangdong Key Lab Inte, Shenzhen 518060, Peoples R China
[2] Univ Glasgow, James Watt Sch Engn, Glasgow G12 8QQ, Lanark, Scotland
[3] Univ Elect Sci & Technol China, Yangtze Delta Reg Inst Huzhou, Huzhou 313001, Scotland
[4] Univ Elect Sci & Technol China, Natl Key Lab Sci & Technol Commun, Chengdu 611731, Peoples R China
[5] Tech Management Ctr, China Media Grp, Beijing 100020, Peoples R China
[6] Univ Delaware, Dept Elect & Comp Engn, Newark, DE 19716 USA
基金
国家重点研发计划;
关键词
Actor-critic algorithm; branching neural network; reinforcement learning; mobile edge caching; 5G NETWORKS; SMALL-CELL; DELIVERY; POLICY;
D O I
10.1109/TCCN.2021.3130995
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Mobile edge caching/computing (MEC) has emerged as a promising approach for addressing the drastic increasing mobile data traffic by bringing high caching and computing capabilities to the edge of networks. Under MEC architecture, content providers (CPs) are allowed to lease some virtual machines (VMs) at MEC servers to proactively cache popular contents for improving users' quality of experience. The scalable cache resource model rises the challenge for determining the ideal number of leased VMs for CPs to obtain the minimum expected downloading delay of users at the lowest caching cost. To address these challenges, in this paper, we propose an actor-critic (AC) reinforcement learning based proactive caching policy for mobile edge networks without the prior knowledge of users' content demand. Specifically, we formulate the proactive caching problem under dynamical users' content demand as a Markov decision process and propose a AC based caching algorithm to minimize the caching cost and the expected downloading delay. Particularly, to reduce the computational complexity, a branching neural network is employed to approximate the policy function in the actor part. Numerical results show that the proposed caching algorithm can significantly reduce the total cost and the average downloading delay when compared with other popular algorithms.
引用
收藏
页码:1239 / 1252
页数:14
相关论文
共 50 条
  • [41] Forward Actor-Critic for Nonlinear Function Approximation in Reinforcement Learning
    Veeriah, Vivek
    van Seijen, Harm
    Sutton, Richard S.
    AAMAS'17: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2017, : 556 - 564
  • [42] Maximizing Information Usefulness in Vehicular CP Networks Using Actor-Critic Reinforcement Learning
    Ghnaya, Imed
    Ahmed, Toufik
    Mosbah, Mohamed
    Aniss, Hasnaa
    2022 18TH INTERNATIONAL CONFERENCE ON NETWORK AND SERVICE MANAGEMENT (CNSM 2022): INTELLIGENT MANAGEMENT OF DISRUPTIVE NETWORK TECHNOLOGIES AND SERVICES, 2022, : 296 - 302
  • [43] Proactive content caching by exploiting transfer learning for mobile edge computing
    Hou, Tingting
    Feng, Gang
    Qin, Shuang
    Jiang, Wei
    INTERNATIONAL JOURNAL OF COMMUNICATION SYSTEMS, 2018, 31 (11)
  • [44] Proactive Content Caching by Exploiting Transfer Learning for Mobile Edge Computing
    Hou, Tingting
    Feng, Gang
    Qin, Shuang
    Jiang, Wei
    GLOBECOM 2017 - 2017 IEEE GLOBAL COMMUNICATIONS CONFERENCE, 2017,
  • [45] THE APPLICATION OF ACTOR-CRITIC REINFORCEMENT LEARNING FOR FAB DISPATCHING SCHEDULING
    Kim, Namyong
    Shin, IIayong
    2017 WINTER SIMULATION CONFERENCE (WSC), 2017, : 4570 - 4571
  • [46] ACTOR-CRITIC DEEP REINFORCEMENT LEARNING FOR DYNAMIC MULTICHANNEL ACCESS
    Zhong, Chen
    Lu, Ziyang
    Gursoy, M. Cenk
    Velipasalar, Senem
    2018 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP 2018), 2018, : 599 - 603
  • [47] Asynchronous Actor-Critic for Multi-Agent Reinforcement Learning
    Xiao, Yuchen
    Tan, Weihao
    Amato, Christopher
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [48] Actor-Critic Deep Reinforcement Learning for Energy Minimization in UAV-Aided Networks
    Yuan, Yaxiong
    Lei, Lei
    Vu, Thang X.
    Chatzinotas, Symeon
    Ottersten, Bjorn
    2020 EUROPEAN CONFERENCE ON NETWORKS AND COMMUNICATIONS (EUCNC 2020), 2020, : 348 - 352
  • [49] An Actor-Critic Hierarchical Reinforcement Learning Model for Course Recommendation
    Liang, Kun
    Zhang, Guoqiang
    Guo, Jinhui
    Li, Wentao
    ELECTRONICS, 2023, 12 (24)
  • [50] Network Congestion Control Algorithm Based on Actor-Critic Reinforcement Learning Model
    Xu, Tao
    Gong, Lina
    Zhang, Wei
    Li, Xuhong
    Wang, Xia
    Pan, Wenwen
    ADVANCES IN MATERIALS, MACHINERY, ELECTRONICS II, 2018, 1955