Joint Caching and Computing Service Placement for Edge-Enabled IoT Based on Deep Reinforcement Learning

被引:15
|
作者
Chen, Yan [1 ]
Sun, Yanjing [1 ]
Yang, Bin [2 ]
Taleb, Tarik [3 ,4 ]
机构
[1] China Univ Min & Technol, Sch Informat & Control Engn, Xuzhou 221116, Jiangsu, Peoples R China
[2] Chuzhou Univ, Sch Comp & Informat Engn, Chuzhou 239000, Anhui, Peoples R China
[3] Univ Oulu, Ctr Wireless Commun, Oulu 90570, Finland
[4] Sejong Univ, Dept Comp & Informat Secur, Seoul 05006, South Korea
基金
芬兰科学院; 中国国家自然科学基金;
关键词
Deep reinforcement learning (DRL); edge computing; IoT; joint service placement; sensing-data-driven applications;
D O I
10.1109/JIOT.2022.3168869
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
By placing edge service functions in proximity to IoT facilities, edge computing can satisfy various IoT applications' resource and latency requirements. Sensing-data-driven IoT applications are prevalent in IoT systems, and their task processing relies on sensing data from sensors. Therefore, to ensure the Quality of Service (QoS) of such applications in an edge-enabled IoT system, dedicated caching functions (CFs) are required to cache necessary sensing data. This article considers an edge-enabled IoT system and investigates the joint caching and computing service placement (JCCSP) problem for sensing-data-driven IoT applications. Then, deep reinforcement learning (DRL) is exploited to address the problem since it can adapt to a heterogeneous system with limited prior knowledge. In the proposed DRL-based approaches, a policy network based on the encoder-decoder model is constructed to address the issue of varying sizes of JCCSP states and actions caused by different numbers of CFs related to applications. Then, an on-policy REINFORCE-based method is adopted to train the policy network. After that an off-policy training method based on the twin-delayed (TD) deep deterministic policy gradient (DDPG) is proposed to enhance the training efficiency and experience utilization. In the proposed DDPG-based method, a weight-averaged twin-Q-delayed (WATQD) algorithm is introduced to reduce the bias of Q-value estimation. Simulation results show that our proposed DRL-based JCCSP approaches can achieve converged performance that is significantly superior to benchmarks. Moreover, compared with the original TD method, the proposed WATQD method can significantly improve the training stability.
引用
收藏
页码:19501 / 19514
页数:14
相关论文
共 50 条
  • [1] QoE-Driven Content-Centric Caching With Deep Reinforcement Learning in Edge-Enabled IoT
    He, Xiaoming
    Wang, Kun
    Xu, Wenyao
    [J]. IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2019, 14 (04) : 12 - 20
  • [2] DRLD-SP: A Deep-Reinforcement-Learning-Based Dynamic Service Placement in Edge-Enabled Internet of Vehicles
    Talpur, Anum
    Gurusamy, Mohan
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (08): : 6239 - 6251
  • [3] Deep Reinforcement Learning-based Joint Caching and Computing Edge Service Placement for Sensing-Data-Driven IIoT Applications
    Chen, Yan
    Sun, Yanjing
    Yang, Bin
    Taleb, Tarik
    [J]. IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2022), 2022, : 4287 - 4292
  • [4] Joint Service Caching and Computation Offloading Scheme Based on Deep Reinforcement Learning in Vehicular Edge Computing Systems
    Xue, Zheng
    Liu, Chang
    Liao, Canliang
    Han, Guojun
    Sheng, Zhengguo
    [J]. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (05) : 6709 - 6722
  • [5] Remote Attestation as a Service for Edge-Enabled IoT
    Calvo, Miguel
    Beltran, Marta
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON SERVICES COMPUTING (SCC 2021), 2021, : 329 - 339
  • [6] Deep-Reinforcement-Learning-Based Service Placement for Video Analysis in Edge Computing
    Zhu, Qijun
    Wang, Sichen
    Huang, Hualong
    Lei, Yuchuan
    Zhan, Wenhan
    Duan, Hancong
    [J]. 2023 8TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYTICS, ICCCBDA, 2023, : 355 - 359
  • [7] JCSP: Joint Caching and Service Placement for Edge Computing Systems
    Gao, Yicheng
    Casale, Giuliano
    [J]. 2022 IEEE/ACM 30TH INTERNATIONAL SYMPOSIUM ON QUALITY OF SERVICE (IWQOS), 2022,
  • [8] Deep Reinforcement Learning-based computation offloading and distributed edge service caching for Mobile Edge Computing
    Xie, Mande
    Ye, Jiefeng
    Zhang, Guoping
    Ni, Xueping
    [J]. COMPUTER NETWORKS, 2024, 250
  • [9] Hierarchical Deep Reinforcement Learning for Joint Service Caching and Computation Offloading in Mobile Edge-Cloud Computing
    Sun, Chuan
    Li, Xiuhua
    Wang, Chenyang
    He, Qiang
    Wang, Xiaofei
    Leung, Victor C. M.
    [J]. IEEE TRANSACTIONS ON SERVICES COMPUTING, 2024, 17 (04) : 1548 - 1564
  • [10] HFDRL: An Intelligent Dynamic Cooperate Cashing Method Based on Hierarchical Federated Deep Reinforcement Learning in Edge-Enabled IoT
    Majidi, Fariba
    Khayyambashi, Mohammad Reza
    Barekatain, Behrang
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (02) : 1402 - 1413