DAM: Deep Reinforcement Learning based Preload Algorithm with Action Masking for Short Video Streaming

被引:8
|
作者
Qian, Si-Ze [1 ]
Xie, Yuhong [1 ]
Pan, Zipeng [1 ]
Zhang, Yuan [2 ]
Lin, Tao [2 ]
机构
[1] Commun Univ China, Beijing, Peoples R China
[2] Commun Univ China, State Key Lab Media Convergence & Commun, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Short video streaming; reinforcement learning; action masking;
D O I
10.1145/3503161.3551573
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Short video streaming has been increasingly popular in recent years. Due to its unique user behavior of watching and sliding, a critical technique issue is to design a preload algorithm deciding which video chunk to download next, bitrate selection and the pause time, in order to improve user experience while reducing bandwidth wastage. However, designing such a preload algorithm is non-trivial, especially taking into account conflicting goals of improving QoE and reducing bandwidth wastage. In this paper, we propose a deep reinforcement learning-based approach to simultaneously decide the aforementioned three decision variables via learning an optimal policy under a complex environment of varying network conditions and unpredictable user behavior. In particular, we incorporate domain knowledge into the decision procedure via action masking to make decisions more transparent, and accelerate the model training. Experimental results validate the proposed approach significantly outperforms baseline algorithms in terms of QoE metrics and bandwidth wastage.
引用
收藏
页码:7030 / 7034
页数:5
相关论文
共 50 条
  • [41] Perceptual Quality Aware Adaptive 360-Degree Video Streaming with Deep Reinforcement Learning
    Feng, Qingxuan
    Yang, Peng
    Lyu, Feng
    Yu, Li
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2022), 2022, : 1190 - 1195
  • [42] Deep Curriculum Reinforcement Learning for Adaptive 360 Video Streaming With Two-Stage Training
    Xie, Yuhong
    Zhang, Yuan
    Lin, Tao
    IEEE TRANSACTIONS ON BROADCASTING, 2024, 70 (02) : 441 - 452
  • [43] Toward Optimal Real-Time Volumetric Video Streaming: A Rolling Optimization and Deep Reinforcement Learning Based Approach
    Li, Jie
    Wang, Huiyu
    Liu, Zhi
    Zhou, Pengyuan
    Chen, Xianfu
    Li, Qiyue
    Hong, Richang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (12) : 7870 - 7883
  • [44] Deep reinforcement learning based QoE-aware actor-learner architectures for video streaming in IoT environments
    Naresh, Mandan
    Das, Vikramjeet
    Saxena, Paresh
    Gupta, Manik
    COMPUTING, 2022, 104 (07) : 1527 - 1550
  • [45] Deep reinforcement learning based QoE-aware actor-learner architectures for video streaming in IoT environments
    Mandan Naresh
    Vikramjeet Das
    Paresh Saxena
    Manik Gupta
    Computing, 2022, 104 : 1527 - 1550
  • [46] Curriculum goal masking for continuous deep reinforcement learning
    Eppe, Manfred
    Magg, Sven
    Wermter, Stefan
    2019 JOINT IEEE 9TH INTERNATIONAL CONFERENCE ON DEVELOPMENT AND LEARNING AND EPIGENETIC ROBOTICS (ICDL-EPIROB), 2019, : 183 - 188
  • [47] DEEP REINFORCEMENT LEARNING FOR VIDEO PREDICTION
    Ho, Yung-Han
    Cho, Chuan-Yuan
    Peng, Wen-Hsiao
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 604 - 608
  • [48] Filter Pruning Algorithm Based on Deep Reinforcement Learning
    Liu Y.
    Teng Y.
    Niu T.
    Zhi J.
    Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2023, 46 (03): : 31 - 36
  • [49] BeiDou Short-Message Satellite Resource Allocation Algorithm Based on Deep Reinforcement Learning
    Xia, Kaiwen
    Feng, Jing
    Yan, Chao
    Duan, Chaofan
    ENTROPY, 2021, 23 (08)
  • [50] Unsupervised Video Summarization Based on Deep Reinforcement Learning with Interpolation
    Yoon, Ui Nyoung
    Hong, Myung Duk
    Jo, Geun-Sik
    SENSORS, 2023, 23 (07)