DAM: Deep Reinforcement Learning based Preload Algorithm with Action Masking for Short Video Streaming

被引:8
|
作者
Qian, Si-Ze [1 ]
Xie, Yuhong [1 ]
Pan, Zipeng [1 ]
Zhang, Yuan [2 ]
Lin, Tao [2 ]
机构
[1] Commun Univ China, Beijing, Peoples R China
[2] Commun Univ China, State Key Lab Media Convergence & Commun, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Short video streaming; reinforcement learning; action masking;
D O I
10.1145/3503161.3551573
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Short video streaming has been increasingly popular in recent years. Due to its unique user behavior of watching and sliding, a critical technique issue is to design a preload algorithm deciding which video chunk to download next, bitrate selection and the pause time, in order to improve user experience while reducing bandwidth wastage. However, designing such a preload algorithm is non-trivial, especially taking into account conflicting goals of improving QoE and reducing bandwidth wastage. In this paper, we propose a deep reinforcement learning-based approach to simultaneously decide the aforementioned three decision variables via learning an optimal policy under a complex environment of varying network conditions and unpredictable user behavior. In particular, we incorporate domain knowledge into the decision procedure via action masking to make decisions more transparent, and accelerate the model training. Experimental results validate the proposed approach significantly outperforms baseline algorithms in terms of QoE metrics and bandwidth wastage.
引用
收藏
页码:7030 / 7034
页数:5
相关论文
共 50 条
  • [21] Latency Aware Adaptive Video Streaming using Ensemble Deep Reinforcement Learning
    Zhao, Yin
    Shen, Qi-Wei
    Li, Wei
    Xu, Tong
    Niu, Wei-Hua
    Xu, Si-Ran
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 2647 - 2651
  • [22] HotDASH: Hotspot Aware Adaptive Video Streaming using Deep Reinforcement Learning
    Sengupta, Satadal
    Ganguly, Niloy
    Chakraborty, Sandip
    De, Pradipta
    2018 IEEE 26TH INTERNATIONAL CONFERENCE ON NETWORK PROTOCOLS (ICNP), 2018, : 165 - 175
  • [23] DEEP REINFORCEMENT LEARNING-BASED RATE ADAPTATION FOR ADAPTIVE 360-DEGREE VIDEO STREAMING
    Kan, Nuowen
    Zou, Junni
    Tang, Kexin
    Li, Chenglin
    Liu, Ning
    Xiong, Hongkai
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 4030 - 4034
  • [24] Deep-Reinforcement-Learning-based User-Preference-Aware Rate Adaptation for Video Streaming
    Lu, Lingyun
    Xiao, Jun
    Ni, Wei
    Du, Haifeng
    Zhang, Dalin
    2022 IEEE 23RD INTERNATIONAL SYMPOSIUM ON A WORLD OF WIRELESS, MOBILE AND MULTIMEDIA NETWORKS (WOWMOM 2022), 2022, : 416 - 424
  • [25] Reinforcement learning-based rate adaptation in dynamic video streaming
    Hafez, N. A.
    Hassan, M. S.
    Landolsi, T.
    TELECOMMUNICATION SYSTEMS, 2023, 83 (04) : 395 - 407
  • [26] Reinforcement learning-based rate adaptation in dynamic video streaming
    N. A. Hafez
    M. S. Hassan
    T. Landolsi
    Telecommunication Systems, 2023, 83 : 395 - 407
  • [27] Video Emotional Classification Based on Deep Reinforcement Learning
    Yuan, Tingting
    Yuan, Yuyu
    2023 3RD ASIA-PACIFIC CONFERENCE ON COMMUNICATIONS TECHNOLOGY AND COMPUTER SCIENCE, ACCTCS, 2023, : 168 - 171
  • [28] Adaptive Video Streaming via Deep Reinforcement Learning from User Trajectory Preferences
    Xiao, Qingyu
    Ye, Jin
    Pang, Chengjie
    Ma, Liangdi
    Jiang, Wengchao
    2020 IEEE 39TH INTERNATIONAL PERFORMANCE COMPUTING AND COMMUNICATIONS CONFERENCE (IPCCC), 2020,
  • [29] PAAS: A Preference-Aware Deep Reinforcement Learning Approach for 360° Video Streaming
    Wu, Chenglei
    Wang, Zhi
    Sun, Lifeng
    PROCEEDINGS OF THE 31ST ACM WORKSHOP ON NETWORK AND OPERATING SYSTEMS SUPPORT FOR DIGITAL AUDIO AND VIDEO (NOSSDAV '21), 2021, : 35 - 41
  • [30] Constrained Deep Reinforcement Learning for Low-Latency Wireless VR Video Streaming
    Li, Shaoang
    She, Changyang
    Li, Yonghui
    Vucetic, Branka
    2021 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2021,