Short video preloading via domain knowledge assisted deep reinforcement learning

被引:0
|
作者
Yuhong Xie [1 ]
Yuan Zhang [2 ]
Tao Lin [2 ]
Zipeng Pan [1 ]
SiZe Qian [1 ]
Bo Jiang [3 ]
Jinyao Yan [2 ]
机构
[1] School of Information and Communication Engineering, Communication University of China
[2] State Key Laboratory of Media Convergence and Communication, Communication University of China
[3] School of Electronic Information and Electrical Engineering, Shanghai Jiao Tong
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Short video applications like Tik Tok have seen significant growth in recent years. One common behavior of users on these platforms is watching and swiping through videos, which can lead to a significant waste of bandwidth. As such, an important challenge in short video streaming is to design a preloading algorithm that can effectively decide which videos to download, at what bitrate, and when to pause the download in order to reduce bandwidth waste while improving the Quality of Experience(QoE). However, designing such an algorithm is non-trivial, especially when considering the conflicting objectives of minimizing bandwidth waste and maximizing QoE. In this paper, we propose an end-to-end Deep reinforcement learning framework with Action Masking called DAM that leverages domain knowledge to learn an optimal policy for short video preloading. To achieve this, we introduce a reward shaping technique to minimize bandwidth waste and use action masking to make actions more reasonable, reduce playback rebuffering, and accelerate the training process. We have conducted extensive experiments using real-world video datasets and network traces including 4G/Wi Fi/5G. Our results show that DAM improves the Qo E score by 3.73%-11.28% compared to state-of-the-art algorithms, and achieves an average bandwidth waste of only 10.27%-12.07%, outperforming all baseline methods.
引用
收藏
页码:1826 / 1836
页数:11
相关论文
共 50 条
  • [1] Short video preloading via domain knowledge assisted deep reinforcement learning
    Xie, Yuhong
    Zhang, Yuan
    Lin, Tao
    Pan, Zipeng
    Qian, Si-Ze
    Jiang, Bo
    Yan, Jinyao
    [J]. Digital Communications and Networks, 2024, 10 (06) : 1826 - 1836
  • [2] Domain Knowledge-Assisted Deep Reinforcement Learning Power Allocation for MIMO Radar Detection
    Wang, Yuedong
    Liang, Yan
    Zhang, Huixia
    Gu, Yijing
    [J]. IEEE SENSORS JOURNAL, 2022, 22 (23) : 23117 - 23128
  • [3] Look-ahead Dispatch Method via Deep Reinforcement Learning Embedded With Domain Knowledge
    Cheng, Liangcheng
    Yan, Jiahao
    Yao, Jianguo
    Yang, Shengchun
    Li, Yaping
    [J]. Dianwang Jishu/Power System Technology, 2024, 48 (08): : 3133 - 3142
  • [4] Deep Reinforcement Learning Task Assignment Based on Domain Knowledge
    Liu, Jiayi
    Wang, Gang
    Guo, Xiangke
    Wang, Siyuan
    Fu, Qiang
    [J]. IEEE Access, 2022, 10 : 114402 - 114413
  • [5] Leveraging Domain Knowledge for Robust Deep Reinforcement Learning in Networking
    Zheng, Ying
    Chen, Haoyu
    Duan, Qingyang
    Lin, Lixiang
    Shao, Yiyang
    Wang, Wei
    Wang, Xin
    Xu, Yuedong
    [J]. IEEE CONFERENCE ON COMPUTER COMMUNICATIONS (IEEE INFOCOM 2021), 2021,
  • [6] Deep Reinforcement Learning Task Assignment Based on Domain Knowledge
    Liu, Jiayi
    Wang, Gang
    Guo, Xiangke
    Wang, Siyuan
    Fu, Qiang
    [J]. IEEE ACCESS, 2022, 10 : 114402 - 114413
  • [7] Cooperative Wind Farm Control With Deep Reinforcement Learning and Knowledge-Assisted Learning
    Zhao, Huan
    Zhao, Junhua
    Qiu, Jing
    Liang, Gaoqi
    Dong, Zhao Yang
    [J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (11) : 6912 - 6921
  • [8] DEEP REINFORCEMENT LEARNING FOR VIDEO PREDICTION
    Ho, Yung-Han
    Cho, Chuan-Yuan
    Peng, Wen-Hsiao
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 604 - 608
  • [9] Cross-Domain Sentiment Classification via Deep Reinforcement Learning
    Dou, Lintao
    Huang, Jian
    [J]. 2022 5TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND NATURAL LANGUAGE PROCESSING, MLNLP 2022, 2022, : 337 - 341
  • [10] Making TCP BBR Pacing Adaptive With Domain Knowledge Assisted Reinforcement Learning
    Pan, Wenqi
    Xu, Yuedong
    Liu, Shaoteng
    [J]. IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2023, 10 (04): : 2250 - 2264