Imitation Learning for Adaptive Video Streaming With Future Adversarial Information Bottleneck Principle

被引:0
|
作者
Wang, Shuoyao [1 ]
Lin, Jiawei [1 ]
Ye, Fangwei [2 ]
机构
[1] Shenzhen Univ, Coll Elect & Informat Engn, Shenzhen 518060, Peoples R China
[2] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 210095, Peoples R China
基金
中国国家自然科学基金;
关键词
Adaptive video streaming; imitation learning; information bottleneck; mixed-integer non-linear programming;
D O I
10.1109/TMC.2024.3437455
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Adaptive video streaming plays a crucial role in ensuring high-quality video streaming services. Despite extensive research efforts devoted to Adaptive BitRate (ABR) techniques, the current reinforcement learning (RL)-based ABR algorithms may benefit the average Quality of Experience (QoE) but suffers from fluctuating performance in individual video sessions. In this paper, we present a novel approach that combines imitation learning with the information bottleneck technique, to learn from the complex offline optimal scenario rather than inefficient exploration. In particular, we leverage the deterministic offline bitrate optimization problem with the future throughput realization as the expert and formulate it as a mixed-integer non-linear programming (MINLP) problem. To enable large-scale training for improved performance, we propose an alternative optimization algorithm that efficiently solves the formulated MINLP problem. To address the overfitting issues due to the future information leakage in MINLP, we incorporate an adversarial information bottleneck framework. By compressing the video streaming state into a latent space, we retain only action-relevant information. Additionally, we introduce a future adversarial term to mitigate the influence of future information leakage, where Model Prediction Control (MPC) policy without any future information is employed as the adverse expert. Experimental results demonstrate the effectiveness of our proposed approach in significantly enhancing the quality of adaptive video streaming, providing a 7.30% average QoE improvement and a 30.01% average ranking reduction.
引用
收藏
页码:13670 / 13683
页数:14
相关论文
共 50 条
  • [1] Quality-Aware Neural Adaptive Video Streaming With Lifelong Imitation Learning
    Huang, Tianchi
    Zhou, Chao
    Yao, Xin
    Zhang, Rui-Xiao
    Wu, Chenglei
    Yu, Bing
    Sun, Lifeng
    IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2020, 38 (10) : 2324 - 2342
  • [2] Comyco: Quality-Aware Adaptive Video Streaming via Imitation Learning
    Huang, Tianchi
    Zhou, Chao
    Zhang, Rui-Xiao
    Wu, Chenglei
    Yao, Xin
    Sun, Lifeng
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 429 - 437
  • [3] Deep Learning and the Information Bottleneck Principle
    Tishby, Naftali
    Zaslavsky, Noga
    2015 IEEE INFORMATION THEORY WORKSHOP (ITW), 2015,
  • [4] InfoAT: Improving Adversarial Training Using the Information Bottleneck Principle
    Xu, Mengting
    Zhang, Tao
    Li, Zhongnian
    Zhang, Daoqiang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (01) : 1255 - 1264
  • [5] QoE-Driven Dynamic Adaptive Video Streaming Strategy With Future Information
    Yu, Li
    Tillo, Tammam
    Xiao, Jimin
    IEEE TRANSACTIONS ON BROADCASTING, 2017, 63 (03) : 523 - 534
  • [6] Adversarial Imitation Learning from Video using a State Observer
    Karnan, Haresh
    Torabi, Faraz
    Warnell, Garrett
    Stone, Peter
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022, : 2452 - 2458
  • [7] Applying the Information Bottleneck Principle to Prosodic Representation Learning
    Zhang, Guangyan
    Qin, Ying
    Tan, Daxin
    Lee, Tan
    INTERSPEECH 2021, 2021, : 3156 - 3160
  • [8] Addressing implicit bias in adversarial imitation learning with mutual information
    Zhang, Lihua
    Liu, Quan
    Zhu, Fei
    Huang, Zhigang
    NEURAL NETWORKS, 2023, 167 : 847 - 864
  • [9] Adaptive Information Gathering via Imitation Learning
    Choudhury, Sanjiban
    Kapoor, Ashish
    Ranade, Gireeja
    Scherer, Sebastian
    Dey, Debadeepta
    ROBOTICS: SCIENCE AND SYSTEMS XIII, 2017,
  • [10] Learning Fair Representations for Recommendation via Information Bottleneck Principle
    Xie, Junsong
    Yang, Yonghui
    Wang, Zihan
    Wu, Le
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 2469 - 2477