DAM: Deep Reinforcement Learning based Preload Algorithm with Action Masking for Short Video Streaming

被引:8
|
作者
Qian, Si-Ze [1 ]
Xie, Yuhong [1 ]
Pan, Zipeng [1 ]
Zhang, Yuan [2 ]
Lin, Tao [2 ]
机构
[1] Commun Univ China, Beijing, Peoples R China
[2] Commun Univ China, State Key Lab Media Convergence & Commun, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Short video streaming; reinforcement learning; action masking;
D O I
10.1145/3503161.3551573
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Short video streaming has been increasingly popular in recent years. Due to its unique user behavior of watching and sliding, a critical technique issue is to design a preload algorithm deciding which video chunk to download next, bitrate selection and the pause time, in order to improve user experience while reducing bandwidth wastage. However, designing such a preload algorithm is non-trivial, especially taking into account conflicting goals of improving QoE and reducing bandwidth wastage. In this paper, we propose a deep reinforcement learning-based approach to simultaneously decide the aforementioned three decision variables via learning an optimal policy under a complex environment of varying network conditions and unpredictable user behavior. In particular, we incorporate domain knowledge into the decision procedure via action masking to make decisions more transparent, and accelerate the model training. Experimental results validate the proposed approach significantly outperforms baseline algorithms in terms of QoE metrics and bandwidth wastage.
引用
收藏
页码:7030 / 7034
页数:5
相关论文
共 50 条
  • [31] DRL360: 360-degree Video Streaming with Deep Reinforcement Learning
    Zhang, Yuanxing
    Zhao, Pengyu
    Bian, Kaigui
    Liu, Yunxin
    Song, Lingyang
    Li, Xiaoming
    IEEE CONFERENCE ON COMPUTER COMMUNICATIONS (IEEE INFOCOM 2019), 2019, : 1252 - 1260
  • [32] Deep Learning based Prediction Model for Adaptive Video Streaming
    Lekharu, Anirban
    Moulii, K. Y.
    Sur, Arijit
    Sarkar, Arnab
    2020 INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS & NETWORKS (COMSNETS), 2020,
  • [33] A Deep Graph Reinforcement Learning Model for Improving User Experience in Live Video Streaming
    Antaris, Stelanos
    Rafailidis, Dimitrios
    Gidzijauskas, Sarunas
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 1787 - 1796
  • [34] Hierarchical decision algorithm for air combat with hybrid action based on deep reinforcement learning
    Li, Zuolong
    Zhu, Jihong
    Kuang, Minchi
    Zhang, Jie
    Ren, Jie
    Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2024, 45 (17):
  • [35] Deep Reinforcement Learning Based Adaptive 360-degree Video Streaming with Field of View Joint Prediction
    Zhang, Yuanhong
    Wang, Zhiwen
    Liu, Junquan
    Du, Haipeng
    Zheng, Qinghua
    Zhang, Weizhan
    2022 27TH IEEE SYMPOSIUM ON COMPUTERS AND COMMUNICATIONS (IEEE ISCC 2022), 2022,
  • [36] Startup delay aware short video ordering: Problem, model, and a reinforcement learning based algorithm
    Gao, Zhipeng
    Li, Chunxi
    Zhao, Yongxiang
    Zhang, Baoxian
    Li, Cheng
    PEER-TO-PEER NETWORKING AND APPLICATIONS, 2025, 18 (02)
  • [37] Smart Streaming: Deep Learning Applications in Video Streaming Optimization
    Darwich, Mahmoud
    Khalil, Kasem
    Bayoumi, Magdy
    SOUTHEASTCON 2024, 2024, : 22 - 27
  • [38] Reinforcement Learning Based Rate Adaptation for 360-Degree Video Streaming
    Jiang, Zhiqian
    Zhang, Xu
    Xu, Yiling
    Ma, Zhan
    Sun, Jun
    Zhang, Yunfei
    IEEE TRANSACTIONS ON BROADCASTING, 2021, 67 (02) : 409 - 423
  • [39] A Reinforcement Learning Based Algorithm for Robot Action Planning
    Svaco, Marko
    Jerbic, Bojan
    Polancec, Mateo
    Suligoj, Filip
    ADVANCES IN SERVICE AND INDUSTRIAL ROBOTICS, RAAD 2018, 2019, 67 : 493 - 503
  • [40] Adaptive Video Streaming in Software-defined Mobile Networks: A Deep Reinforcement Learning Approach
    Luo, Jia
    Yu, F. Richard
    Chen, Qianbin
    Tang, Lun
    Zhang, Zhicai
    2019 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2019,