Imitation Learning for Adaptive Video Streaming With Future Adversarial Information Bottleneck Principle

被引:0
|
作者
Wang, Shuoyao [1 ]
Lin, Jiawei [1 ]
Ye, Fangwei [2 ]
机构
[1] Shenzhen Univ, Coll Elect & Informat Engn, Shenzhen 518060, Peoples R China
[2] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 210095, Peoples R China
基金
中国国家自然科学基金;
关键词
Adaptive video streaming; imitation learning; information bottleneck; mixed-integer non-linear programming;
D O I
10.1109/TMC.2024.3437455
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Adaptive video streaming plays a crucial role in ensuring high-quality video streaming services. Despite extensive research efforts devoted to Adaptive BitRate (ABR) techniques, the current reinforcement learning (RL)-based ABR algorithms may benefit the average Quality of Experience (QoE) but suffers from fluctuating performance in individual video sessions. In this paper, we present a novel approach that combines imitation learning with the information bottleneck technique, to learn from the complex offline optimal scenario rather than inefficient exploration. In particular, we leverage the deterministic offline bitrate optimization problem with the future throughput realization as the expert and formulate it as a mixed-integer non-linear programming (MINLP) problem. To enable large-scale training for improved performance, we propose an alternative optimization algorithm that efficiently solves the formulated MINLP problem. To address the overfitting issues due to the future information leakage in MINLP, we incorporate an adversarial information bottleneck framework. By compressing the video streaming state into a latent space, we retain only action-relevant information. Additionally, we introduce a future adversarial term to mitigate the influence of future information leakage, where Model Prediction Control (MPC) policy without any future information is employed as the adverse expert. Experimental results demonstrate the effectiveness of our proposed approach in significantly enhancing the quality of adaptive video streaming, providing a 7.30% average QoE improvement and a 30.01% average ranking reduction.
引用
收藏
页码:13670 / 13683
页数:14
相关论文
共 50 条
  • [41] User-Adaptive Editing for 360° Video Streaming with Deep Reinforcement Learning
    Sassatelli, Lucile
    Winckler, Marco
    Fisichella, Thomas
    Aparicio, Ramon
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 2208 - 2210
  • [42] Improving Generalization for Neural Adaptive Video Streaming via Meta Reinforcement Learning
    Kan, Nuowen
    Jiang, Yuankun
    Li, Chenglin
    Dai, Wenrui
    Zou, Junni
    Xiong, Hongkai
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 3006 - 3016
  • [43] Design and Evaluation of a Self-Learning HTTP Adaptive Video Streaming Client
    Claeys, Maxim
    Latre, Steven
    Famaey, Jeroen
    De Turck, Filip
    IEEE COMMUNICATIONS LETTERS, 2014, 18 (04) : 716 - 719
  • [44] SMASH: a Supervised Machine Learning Approach to Adaptive Video Streaming over HTTP
    Sani, Yusuf
    Raca, Darijo
    Quinlan, Jason J.
    Sreenan, Cormac J.
    2020 TWELFTH INTERNATIONAL CONFERENCE ON QUALITY OF MULTIMEDIA EXPERIENCE (QOMEX), 2020,
  • [45] MANSY: Generalizing Neural Adaptive Immersive Video Streaming With Ensemble and Representation Learning
    Wu, Duo
    Wu, Panlong
    Zhang, Miao
    Wang, Fangxin
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2025, 24 (03) : 1654 - 1668
  • [46] Latency Aware Adaptive Video Streaming using Ensemble Deep Reinforcement Learning
    Zhao, Yin
    Shen, Qi-Wei
    Li, Wei
    Xu, Tong
    Niu, Wei-Hua
    Xu, Si-Ran
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 2647 - 2651
  • [47] Learning-based approach for layered adaptive video streaming over SDN
    Uzakgider, Tuba
    Cetinkaya, Cihat
    Sayit, Muge
    COMPUTER NETWORKS, 2015, 92 : 357 - 368
  • [48] Adaptive Learning Video Streaming with QoE in Multi-Home Heterogeneous Networks
    Vijayashaarathi S.
    NithyaKalyani S.
    Computer Systems Science and Engineering, 2023, 46 (03): : 2881 - 2897
  • [49] HotDASH: Hotspot Aware Adaptive Video Streaming using Deep Reinforcement Learning
    Sengupta, Satadal
    Ganguly, Niloy
    Chakraborty, Sandip
    De, Pradipta
    2018 IEEE 26TH INTERNATIONAL CONFERENCE ON NETWORK PROTOCOLS (ICNP), 2018, : 165 - 175
  • [50] Peer-To-Peer Live Adaptive Video Streaming for Information Centric Cellular Networks
    Detti, Andrea
    Ricci, Bruno
    Blefari-Melazzi, Nicola
    2013 IEEE 24TH INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR, AND MOBILE RADIO COMMUNICATIONS (PIMRC), 2013, : 3583 - 3588