Imitation Learning for Adaptive Video Streaming With Future Adversarial Information Bottleneck Principle

被引:0
|
作者
Wang, Shuoyao [1 ]
Lin, Jiawei [1 ]
Ye, Fangwei [2 ]
机构
[1] Shenzhen Univ, Coll Elect & Informat Engn, Shenzhen 518060, Peoples R China
[2] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 210095, Peoples R China
基金
中国国家自然科学基金;
关键词
Adaptive video streaming; imitation learning; information bottleneck; mixed-integer non-linear programming;
D O I
10.1109/TMC.2024.3437455
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Adaptive video streaming plays a crucial role in ensuring high-quality video streaming services. Despite extensive research efforts devoted to Adaptive BitRate (ABR) techniques, the current reinforcement learning (RL)-based ABR algorithms may benefit the average Quality of Experience (QoE) but suffers from fluctuating performance in individual video sessions. In this paper, we present a novel approach that combines imitation learning with the information bottleneck technique, to learn from the complex offline optimal scenario rather than inefficient exploration. In particular, we leverage the deterministic offline bitrate optimization problem with the future throughput realization as the expert and formulate it as a mixed-integer non-linear programming (MINLP) problem. To enable large-scale training for improved performance, we propose an alternative optimization algorithm that efficiently solves the formulated MINLP problem. To address the overfitting issues due to the future information leakage in MINLP, we incorporate an adversarial information bottleneck framework. By compressing the video streaming state into a latent space, we retain only action-relevant information. Additionally, we introduce a future adversarial term to mitigate the influence of future information leakage, where Model Prediction Control (MPC) policy without any future information is employed as the adverse expert. Experimental results demonstrate the effectiveness of our proposed approach in significantly enhancing the quality of adaptive video streaming, providing a 7.30% average QoE improvement and a 30.01% average ranking reduction.
引用
收藏
页码:13670 / 13683
页数:14
相关论文
共 50 条
  • [21] Learning Accurate Network Dynamics for Enhanced Adaptive Video Streaming
    Yin, Jiaoyang
    Chen, Hao
    Xu, Yiling
    Ma, Zhan
    Xu, Xiaozhong
    IEEE TRANSACTIONS ON BROADCASTING, 2024, 70 (03) : 808 - 821
  • [22] Convex Hull Prediction for Adaptive Video Streaming by Recurrent Learning
    Paul, Somdyuti
    Norkin, Andrey
    Bovik, Alan C.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 5114 - 5128
  • [23] Adaptive Streaming of Stereoscopic Panoramic Video Based on Reinforcement Learning
    Lan, Chengdong
    Rao, Yingjie
    Song, Caixia
    Chen, Jian
    Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2022, 44 (04): : 1461 - 1468
  • [24] Deep Learning based Prediction Model for Adaptive Video Streaming
    Lekharu, Anirban
    Moulii, K. Y.
    Sur, Arijit
    Sarkar, Arnab
    2020 INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS & NETWORKS (COMSNETS), 2020,
  • [25] ADAPTIVE VIDEO STREAMING FOR TECHNOLOGY-ENHANCED LEARNING IN WORKPLACES
    Megliola, Maurizio
    Sanguini, Roberto
    Sesana, Michele
    JOURNAL OF E-LEARNING AND KNOWLEDGE SOCIETY, 2015, 11 (02): : 129 - 142
  • [26] Deep Reinforcement Learning-Based Approach for Video Streaming: Dynamic Adaptive Video Streaming over HTTP
    Souane, Naima
    Bourenane, Malika
    Douga, Yassine
    APPLIED SCIENCES-BASEL, 2023, 13 (21):
  • [27] Contrastive Graph Representation Learning with Adversarial Cross-View Reconstruction and Information Bottleneck
    Shou, Yuntao
    Lan, Haozhi
    Cao, Xiangyong
    NEURAL NETWORKS, 2025, 184
  • [28] IB-GAN: Disentangled Representation Learning with Information Bottleneck Generative Adversarial Networks
    Jeon, Insu
    Lee, Wonkwang
    Pyeon, Myeongjang
    Kim, Gunhee
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 7926 - 7934
  • [29] Grad: Learning for Overhead-aware Adaptive Video Streaming with Scalable Video Coding
    Liu, Yunzhuo
    Jiang, Bo
    Guo, Tian
    Sitaraman, Ramesh K.
    Towsley, Don
    Wang, Xinbing
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 349 - 357
  • [30] Deep Adversarial Imitation Learning of Locomotion Skills from One-shot Video Demonstration
    Zhang, Huiwen
    Liu, Yuwang
    Zhou, Weijia
    2019 9TH IEEE ANNUAL INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (IEEE-CYBER 2019), 2019, : 1257 - 1261