ABRaider: Multiphase Reinforcement Learning for Environment-Adaptive Video Streaming

被引:0
|
作者
Choi, Wangyu [1 ]
Chen, Jiasi [2 ]
Yoon, Jongwon [1 ]
机构
[1] Hanyang Univ, Dept Comp Sci & Engn, Ansan 15588, South Korea
[2] Univ Calif Riverside, Dept Comp Sci & Engn, Riverside, CA 92521 USA
基金
新加坡国家研究基金会;
关键词
Quality of experience; Streaming media; Prediction algorithms; Bandwidth; Bit rate; Machine learning algorithms; Heuristic algorithms; Adaptive bitrate algorithm; federated learning; quality of experience; reinforcement learning; video streaming;
D O I
10.1109/ACCESS.2022.3175209
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
HTTP-based video streaming technology is widely used in today's video delivery services. The streaming solution uses the adaptive bitrate (ABR) algorithm for better video quality and user experience. Despite many efforts to improve the quality of experience (QoE), it is very challenging for ABR algorithms to guarantee high QoE to all users in various environments. The video streaming circumstances in the real world have become even more complicated by the proliferation of mobile devices, high-quality content, and heterogeneous configurations of video players. Many ABR algorithms aim to find monotonous strategies that generally perform well without focusing on the complexity of the environments, which can degrade performance. In this paper, we propose ABRaider that guarantees high QoE to all users in a variety of environments in the real world while being generalized with multiple strategies and specialized in each user's environment. In ABRaider, we propose multi-phase RL consisting of offline and online phases. In the offline phase, ABRaider integrates the strengths of the ABR algorithms and develops policies suitable for various environments. In the online phase, ABRaider focuses on specializing in the environments of individual users by leveraging the computational power of the clients. Experiment results show that ABRaider outperforms existing solutions in various environments, achieving 19.9% and 42.2% QoE improvement in VoD and live streaming, respectively.
引用
收藏
页码:53108 / 53123
页数:16
相关论文
共 50 条
  • [31] Live Video Streaming Optimization Based on Deep Reinforcement Learning
    Zhang, Xueshuai
    Hu, Yuxiang
    Li, Ziyong
    ICMLC 2020: 2020 12TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND COMPUTING, 2018, : 116 - 120
  • [32] Online Learning for Adaptive Video Streaming in Mobile Networks
    Karagkioules, Theodoros
    Paschos, Georgios S.
    Liakopoulos, Nikolaos
    Fiandrotti, Attilio
    Tsilimantos, Dimitrios
    Cagnazzo, Marco
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2022, 18 (01)
  • [33] Reinforcement Learning for Adaptive Video Compressive Sensing
    Lu, Sidi
    Yuan, Xin
    Katsaggelos, Aggelos K.
    Shi, Weisong
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2023, 14 (05)
  • [34] Adaptive Video Streaming Based on Learning Intrinsic Reward
    Feng, Yining
    Wang, Ying
    Liu, Hongyang
    Cong, Lin
    Liu, Yan
    2022 IEEE INTERNATIONAL SYMPOSIUM ON BROADBAND MULTIMEDIA SYSTEMS AND BROADCASTING (BMSB), 2022,
  • [35] Adaptive Streaming Continuous Learning System for Video Analytics
    Li, Tianyu
    Li, Qing
    Zhang, Mei
    Yuan, Zhenhui
    Jiang, Yong
    2024 IEEE/ACM 32ND INTERNATIONAL SYMPOSIUM ON QUALITY OF SERVICE, IWQOS, 2024,
  • [36] A Q-Learning Solution for Adaptive Video Streaming
    Marinca, Dana
    Barth, Dominique
    De Vleeschauwer, Danny
    2013 INTERNATIONAL CONFERENCE ON SELECTED TOPICS IN MOBILE AND WIRELESS NETWORKING (MOWNET), 2013, : 120 - 126
  • [37] Learning from Having Learned: An Environment-adaptive Parking Space Detection Method
    Yang Yi
    Sitan, Jiang
    Zhang Lu
    Wang Jianhang
    2020 AMERICAN CONTROL CONFERENCE (ACC), 2020, : 4022 - 4027
  • [38] Environment-adaptive Interaction Primitives for Human-Robot Motor Skill Learning
    Cui, Yunduan
    Poon, James
    Matsubara, Takamitsu
    Miro, Jaime Valls
    Sugimoto, Kenji
    Yamazaki, Kimitoshi
    2016 IEEE-RAS 16TH INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS (HUMANOIDS), 2016, : 711 - 717
  • [39] Adaptive Streaming of 360-Degree Videos with Reinforcement Learning
    Park, Sohee
    Hoai, Minh
    Bhattacharya, Arani
    Das, Samir R.
    2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2021), 2021, : 1838 - 1847
  • [40] Adaptive Streaming Scheme with Reinforcement Learning in Edge Computing Environments
    Kang, Jeongho
    Chung, Kwangsue
    2023 INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING, ICOIN, 2023, : 128 - 133