Federated Deep Reinforcement Learning for Internet of Things With Decentralized Cooperative Edge Caching

被引:227
|
作者
Wang, Xiaofei [1 ]
Wang, Chenyang [1 ]
Li, Xiuhua [2 ,3 ]
Leung, Victor C. M. [4 ,5 ]
Taleb, Tarik [6 ,7 ,8 ]
机构
[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin 300072, Peoples R China
[2] Chongqing Univ, Sch Big Data & Software Engn, State Key Lab Power Transmiss Equipment & Syst Se, Chongqing 401331, Peoples R China
[3] Chongqing Univ, Key Lab Dependable Serv Comp Cyber Phys Soc, Minist Educ, Chongqing 401331, Peoples R China
[4] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Peoples R China
[5] Univ British Columbia, Dept Elect & Comp Engn, Vancouver, BC V6T 1Z4, Canada
[6] Aalto Univ, Sch Elect Engn, Dept Commun & Networking, Espoo 02150, Finland
[7] Oulu Univ, Informat Technol & Elect Engn, Oulu 90570, Finland
[8] Sejong Univ, Dept Comp & Informat Secur, Seoul 05006, South Korea
基金
芬兰科学院; 加拿大自然科学与工程研究理事会;
关键词
Internet of Things; Training; Delays; Machine learning; Simulation; Electronic mail; Wireless communication; Cooperative caching; deep reinforcement learning (DRL); edge caching; federated learning; hit rate; Internet of Things (IoT); MOBILE; CLOUD; PERFORMANCE; DELIVERY;
D O I
10.1109/JIOT.2020.2986803
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Edge caching is an emerging technology for addressing massive content access in mobile networks to support rapidly growing Internet-of-Things (IoT) services and applications. However, most current optimization-based methods lack a self-adaptive ability in dynamic environments. To tackle these challenges, current learning-based approaches are generally proposed in a centralized way. However, network resources may be overconsumed during the training and data transmission process. To address the complex and dynamic control issues, we propose a federated deep-reinforcement-learning-based cooperative edge caching (FADE) framework. FADE enables base stations (BSs) to cooperatively learn a shared predictive model by considering the first-round training parameters of the BSs as the initial input of the local training, and then uploads near-optimal local parameters to the BSs to participate in the next round of global training. Furthermore, we prove the expectation convergence of FADE. Trace-driven simulation results demonstrate the effectiveness of the proposed FADE framework on reducing the performance loss and average delay, offloading backhaul traffic, and improving the hit rate.
引用
收藏
页码:9441 / 9455
页数:15
相关论文
共 50 条
  • [21] Edge QoE: Computation Offloading With Deep Reinforcement Learning for Internet of Things
    Lu, Haodong
    He, Xiaoming
    Du, Miao
    Ruan, Xiukai
    Sun, Yanfei
    Wang, Kun
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2020, 7 (10): : 9255 - 9265
  • [22] Cooperative Edge Caching Based on Elastic Federated and Multi-Agent Deep Reinforcement Learning in Next-Generation Networks
    Wu, Qiong
    Wang, Wenhua
    Fan, Pingyi
    Fan, Qiang
    Zhu, Huiling
    Letaief, Khaled B.
    [J]. IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2024, 21 (04): : 4179 - 4196
  • [23] Federated Reinforcement Learning with Adaptive Training Times for Edge Caching
    Shaoshuai Fan
    Liyun Hu
    Hui Tian
    [J]. China Communications, 2022, (08) : 57 - 72
  • [24] Deep Reinforcement Learning-based Edge Caching and Multi-link Cooperative Communication in Internet-of-Vehicles
    Ma, Teng
    Chen, Xin
    Jiao, Libo
    Chen, Ying
    [J]. 2021 17TH INTERNATIONAL CONFERENCE ON MOBILITY, SENSING AND NETWORKING (MSN 2021), 2021, : 567 - 574
  • [25] Federated Reinforcement Learning with Adaptive Training Times for Edge Caching
    Fan, Shaoshuai
    Hu, Liyun
    Tian, Hui
    [J]. CHINA COMMUNICATIONS, 2022, 19 (08) : 57 - 72
  • [26] Dynamic Spectrum Access for Internet-of-Things Based on Federated Deep Reinforcement Learning
    Li, Feng
    Shen, Bowen
    Guo, Jiale
    Lam, Kwok-Yan
    Wei, Guiyi
    Wang, Li
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (07) : 7952 - 7956
  • [27] Deep Decentralized Reinforcement Learning for Cooperative Control
    Koepf, Florian
    Tesfazgi, Samuel
    Flad, Michael
    Hohmann, Soeren
    [J]. IFAC PAPERSONLINE, 2020, 53 (02): : 1555 - 1562
  • [28] POSTER: Decentralized Federated Learning for Internet of Things Anomaly Detection
    Lian, Zhuotao
    Su, Chunhua
    [J]. ASIA CCS'22: PROCEEDINGS OF THE 2022 ACM ASIA CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2022, : 1249 - 1251
  • [29] Dynamic spectrum access for Internet-of-Things with hierarchical federated deep reinforcement learning
    Zhang, Songbo
    Lam, Kwok-Yan
    Shen, Bowen
    Wang, Li
    Li, Feng
    [J]. AD HOC NETWORKS, 2023, 149
  • [30] RecCac: Recommendation-Empowered Cooperative Edge Caching for Internet of Things
    HAN Suning
    LI Xiuhua
    SUN Chuan
    WANG Xiaofei
    Victor C.M.LEUNG
    [J]. ZTE Communications, 2021, 19 (02) : 2 - 10