Recommendation-Enabled Edge Caching and D2D Offloading via Incentive-Driven Deep Reinforcement Learning

被引:0
|
作者
Wu, Tong [1 ]
Yu, Dongjin [1 ]
Liu, Chengfei [2 ]
Wang, Dongjing [1 ]
Huang, Binbin [1 ]
机构
[1] Hangzhou Dianzi Univ, Coll Comp Sci & Technol, Hangzhou 310018, Peoples R China
[2] Swinburne Univ Technol, Dept Comp Technol, Melbourne, Vic 3122, Australia
基金
中国国家自然科学基金;
关键词
Device-to-device communication; Costs; Prediction algorithms; Predictive models; Reinforcement learning; Sparse matrices; Quality of experience; Device-to-Device; edge caching; incentive mechanism; recommendation; reinforcement learning;
D O I
10.1109/TSC.2024.3351219
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article proposes a novel architecture of Recommendation-Enabled Edge Caching and Device-to-Device (D2D) Offloading via Incentive-driven Deep Reinforcement Learning (DRL), which can not only solve the problem of inaccurate recommendation caused by sparse rating matrix, but also encourage users to participate in D2D offloading through an effective incentive mechanism. Specifically, we define Pseudo Markov Decision Process (PMDP) for the first time, which enables the conversion of the non-sequential process (e.g. rating prediction) into a sequential one, making it suitable for DRL. Then, combining Supervised Learning (SL) and DRL, a Supervised DRL for Collaborative Filtering (CF) algorithm, named SDRLCF, is proposed to predict missing ratings. After that, from the perspective of Content Service Center (CSC), the incentive-driven recommendation-enabled edge caching and D2D offloading can be formulated as a Non-Linear Integer Programming (NLIP) problem, which belongs to NP-hard, and is difficult to obtain the optimal solution in polynomial time. To address this issue, a DRL based Edge Caching and Recommendation algorithm, named DRLECR, is proposed to minimize the cost of CSC. Finally, combining with economic theory, a Reverse Auction based Payment Determination algorithm under Vickrey-Clarke-Groves (VCG) scheme, named RAPD, is proposed, which can stimulate users to participate in edge caching and D2D offloading while guaranteeing the individual rationality and truthfulness of participants. Extensive experiment results on both realistic and synthetic datasets demonstrate that the proposed algorithms outperform other baseline methods under different scenarios.
引用
收藏
页码:1724 / 1738
页数:15
相关论文
共 50 条
  • [21] Deep Reinforcement Learning for D2D transmission in unlicensed bands
    Zou, Zhiqun
    Yin, Rui
    Chen, Xianfu
    Wu, Celimuge
    2019 IEEE/CIC INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS IN CHINA (ICCC WORKSHOPS), 2019, : 42 - 47
  • [22] Caching Policy for Cache-Enabled D2D Communications by Learning User Preference
    Chen, Binqiang
    Yang, Chenyang
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2018, 66 (12) : 6586 - 6601
  • [23] Learning to Cooperate in D2D Caching Networks
    Paschos, Georgios S.
    Destounis, Apostolos
    Iosifidis, George
    2019 IEEE 20TH INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING ADVANCES IN WIRELESS COMMUNICATIONS (SPAWC 2019), 2019,
  • [24] Energy Efficiency for Data Offloading in D2D Cooperative Caching Networks
    Wang, Weiguang
    Li, Hui
    Zhang, Wenjie
    Wei, Shanlin
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2020, 2020
  • [25] Mobility Increases the Data Offloading Ratio in D2D Caching Networks
    Wang, Rui
    Zhang, Jun
    Song, S. H.
    Letaief, K. B.
    2017 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2017,
  • [26] A Caching Strategy Towards Maximal D2D Assisted Offloading Gain
    Pan, Yijin
    Pan, Cunhua
    Yang, Zhaohui
    Chen, Ming
    Wang, Jiangzhou
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2020, 19 (11) : 2489 - 2504
  • [27] Caching-Enabled Computation Offloading in Multi-Region MEC Network via Deep Reinforcement Learning
    Yang, Song
    Liu, Jintian
    Zhang, Fei
    Li, Fan
    Chen, Xu
    Fu, Xiaoming
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (21) : 21086 - 21098
  • [28] Efficient D2D Content Caching Using Multi-Agent Reinforcement Learning
    Jiang, Wei
    Feng, Gang
    Qin, Shuang
    Yum, Tak Shing Peter
    IEEE INFOCOM 2018 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (INFOCOM WKSHPS), 2018, : 511 - 516
  • [29] Signaling-Based Incentive Mechanism for D2D Computation Offloading
    Chen, Min
    Wang, Haibo
    Han, Dafeng
    Chu, Xiaoli
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (06): : 4639 - 4649
  • [30] Incentive Mechanism Design for Green Mobile D2D Caching Networks
    Zheng, Qiming
    Shan, Hangguan
    Hou, Fen
    Shi, Zhiguo
    Zhang, Zhaoyang
    IEEE TRANSACTIONS ON GREEN COMMUNICATIONS AND NETWORKING, 2022, 6 (01): : 484 - 499