Short video preloading via domain knowledge assisted deep reinforcement learning

被引：0

作者：

Yuhong Xie ^{[1
]}

Yuan Zhang ^{[2
]}

Tao Lin ^{[2
]}

Zipeng Pan ^{[1
]}

SiZe Qian ^{[1
]}

Bo Jiang ^{[3
]}

Jinyao Yan ^{[2
]}

机构：

[1] School of Information and Communication Engineering, Communication University of China

[2] State Key Laboratory of Media Convergence and Communication, Communication University of China

[3] School of Electronic Information and Electrical Engineering, Shanghai Jiao Tong

来源：

Digital Communications and Networks | 2024年 / 10卷 / 06期

关键词：

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Short video applications like Tik Tok have seen significant growth in recent years. One common behavior of users on these platforms is watching and swiping through videos, which can lead to a significant waste of bandwidth. As such, an important challenge in short video streaming is to design a preloading algorithm that can effectively decide which videos to download, at what bitrate, and when to pause the download in order to reduce bandwidth waste while improving the Quality of Experience(QoE). However, designing such an algorithm is non-trivial, especially when considering the conflicting objectives of minimizing bandwidth waste and maximizing QoE. In this paper, we propose an end-to-end Deep reinforcement learning framework with Action Masking called DAM that leverages domain knowledge to learn an optimal policy for short video preloading. To achieve this, we introduce a reward shaping technique to minimize bandwidth waste and use action masking to make actions more reasonable, reduce playback rebuffering, and accelerate the training process. We have conducted extensive experiments using real-world video datasets and network traces including 4G/Wi Fi/5G. Our results show that DAM improves the Qo E score by 3.73%-11.28% compared to state-of-the-art algorithms, and achieves an average bandwidth waste of only 10.27%-12.07%, outperforming all baseline methods.

引用

页码：1826 / 1836

页数：11

共 50 条

[1] Short video preloading via domain knowledge assisted deep reinforcement learning
Xie, Yuhong
Zhang, Yuan
Lin, Tao
Pan, Zipeng
Qian, Si-Ze
Jiang, Bo
Yan, Jinyao
[J]. Digital Communications and Networks, 2024, 10 (06) : 1826 - 1836
[2] Domain Knowledge-Assisted Deep Reinforcement Learning Power Allocation for MIMO Radar Detection
Wang, Yuedong
Liang, Yan
Zhang, Huixia
Gu, Yijing
[J]. IEEE SENSORS JOURNAL, 2022, 22 (23) : 23117 - 23128
[3] Look-ahead Dispatch Method via Deep Reinforcement Learning Embedded With Domain Knowledge
Cheng, Liangcheng
Yan, Jiahao
Yao, Jianguo
Yang, Shengchun
Li, Yaping
[J]. Dianwang Jishu/Power System Technology, 2024, 48 (08): : 3133 - 3142
[4] Deep Reinforcement Learning Task Assignment Based on Domain Knowledge
Liu, Jiayi
Wang, Gang
Guo, Xiangke
Wang, Siyuan
Fu, Qiang
[J]. IEEE Access, 2022, 10 : 114402 - 114413
[5] Leveraging Domain Knowledge for Robust Deep Reinforcement Learning in Networking
Zheng, Ying
Chen, Haoyu
Duan, Qingyang
Lin, Lixiang
Shao, Yiyang
Wang, Wei
Wang, Xin
Xu, Yuedong
[J]. IEEE CONFERENCE ON COMPUTER COMMUNICATIONS (IEEE INFOCOM 2021), 2021,
[6] Deep Reinforcement Learning Task Assignment Based on Domain Knowledge
Liu, Jiayi
Wang, Gang
Guo, Xiangke
Wang, Siyuan
Fu, Qiang
[J]. IEEE ACCESS, 2022, 10 : 114402 - 114413
[7] Cooperative Wind Farm Control With Deep Reinforcement Learning and Knowledge-Assisted Learning
Zhao, Huan
Zhao, Junhua
Qiu, Jing
Liang, Gaoqi
Dong, Zhao Yang
[J]. IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (11) : 6912 - 6921
[8] DEEP REINFORCEMENT LEARNING FOR VIDEO PREDICTION
Ho, Yung-Han
Cho, Chuan-Yuan
Peng, Wen-Hsiao
[J]. 2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 604 - 608
[9] Cross-Domain Sentiment Classification via Deep Reinforcement Learning
Dou, Lintao
Huang, Jian
[J]. 2022 5TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND NATURAL LANGUAGE PROCESSING, MLNLP 2022, 2022, : 337 - 341
[10] Making TCP BBR Pacing Adaptive With Domain Knowledge Assisted Reinforcement Learning
Pan, Wenqi
Xu, Yuedong
Liu, Shaoteng
[J]. IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2023, 10 (04): : 2250 - 2264

← 1 2 3 4 5 →