Short video preloading via domain knowledge assisted deep reinforcement learning

被引:0
|
作者
Yuhong Xie [1 ]
Yuan Zhang [2 ]
Tao Lin [2 ]
Zipeng Pan [1 ]
SiZe Qian [1 ]
Bo Jiang [3 ]
Jinyao Yan [2 ]
机构
[1] School of Information and Communication Engineering, Communication University of China
[2] State Key Laboratory of Media Convergence and Communication, Communication University of China
[3] School of Electronic Information and Electrical Engineering, Shanghai Jiao Tong
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Short video applications like Tik Tok have seen significant growth in recent years. One common behavior of users on these platforms is watching and swiping through videos, which can lead to a significant waste of bandwidth. As such, an important challenge in short video streaming is to design a preloading algorithm that can effectively decide which videos to download, at what bitrate, and when to pause the download in order to reduce bandwidth waste while improving the Quality of Experience(QoE). However, designing such an algorithm is non-trivial, especially when considering the conflicting objectives of minimizing bandwidth waste and maximizing QoE. In this paper, we propose an end-to-end Deep reinforcement learning framework with Action Masking called DAM that leverages domain knowledge to learn an optimal policy for short video preloading. To achieve this, we introduce a reward shaping technique to minimize bandwidth waste and use action masking to make actions more reasonable, reduce playback rebuffering, and accelerate the training process. We have conducted extensive experiments using real-world video datasets and network traces including 4G/Wi Fi/5G. Our results show that DAM improves the Qo E score by 3.73%-11.28% compared to state-of-the-art algorithms, and achieves an average bandwidth waste of only 10.27%-12.07%, outperforming all baseline methods.
引用
下载
收藏
页码:1826 / 1836
页数:11
相关论文
共 50 条
  • [21] Bayesian Deep Reinforcement Learning via Deep Kernel Learning
    Xuan, Junyu
    Lu, Jie
    Yan, Zheng
    Zhang, Guangquan
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2019, 12 (01) : 164 - 171
  • [22] Adaptive Video Streaming via Deep Reinforcement Learning from User Trajectory Preferences
    Xiao, Qingyu
    Ye, Jin
    Pang, Chengjie
    Ma, Liangdi
    Jiang, Wengchao
    2020 IEEE 39TH INTERNATIONAL PERFORMANCE COMPUTING AND COMMUNICATIONS CONFERENCE (IPCCC), 2020,
  • [23] User preference-aware video highlight detection via deep reinforcement learning
    Wang, Han
    Wang, Kexin
    Wu, Yuqing
    Wang, Zhongzhi
    Zou, Ling
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (21-22) : 15015 - 15024
  • [24] Filtration network: A frame sampling strategy via deep reinforcement learning for video captioning
    Qian, Tiancheng
    Mei, Xue
    Xu, Pengxiang
    Ge, Kangqi
    Qiu, Zhelei
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (06) : 11085 - 11097
  • [25] User preference-aware video highlight detection via deep reinforcement learning
    Han Wang
    Kexin Wang
    Yuqing Wu
    Zhongzhi Wang
    Ling Zou
    Multimedia Tools and Applications, 2020, 79 : 15015 - 15024
  • [26] Fast and Reliable Offloading via Deep Reinforcement Learning for Mobile Edge Video Computing
    Park, Soohyun
    Kang, Yeongeun
    Tian, Yafei
    Kim, Joongheon
    2020 34TH INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING (ICOIN 2020), 2020, : 10 - 12
  • [27] Unsupervised Video Summarization via Deep Reinforcement Learning With Shot-Level Semantics
    Yuan, Ye
    Zhang, Jiawan
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (01) : 445 - 456
  • [28] Video Captioning via Hierarchical Reinforcement Learning
    Wang, Xin
    Chen, Wenhu
    Wu, Jiawei
    Wang, Yuan-Fang
    Wang, William Yang
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 4213 - 4222
  • [29] Improving Deep Reinforcement Learning with Knowledge Transfer
    Glatt, Ruben
    Reali Costa, Anna Helena
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 5036 - 5037
  • [30] Towards Knowledge Transfer in Deep Reinforcement Learning
    Glatt, Ruben
    da Silva, Felipe Leno
    Reali Costa, Anna Helena
    PROCEEDINGS OF 2016 5TH BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS 2016), 2016, : 91 - 96