Imitation Learning for Adaptive Video Streaming With Future Adversarial Information Bottleneck Principle

被引:0
|
作者
Wang, Shuoyao [1 ]
Lin, Jiawei [1 ]
Ye, Fangwei [2 ]
机构
[1] Shenzhen Univ, Coll Elect & Informat Engn, Shenzhen 518060, Peoples R China
[2] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 210095, Peoples R China
基金
中国国家自然科学基金;
关键词
Adaptive video streaming; imitation learning; information bottleneck; mixed-integer non-linear programming;
D O I
10.1109/TMC.2024.3437455
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Adaptive video streaming plays a crucial role in ensuring high-quality video streaming services. Despite extensive research efforts devoted to Adaptive BitRate (ABR) techniques, the current reinforcement learning (RL)-based ABR algorithms may benefit the average Quality of Experience (QoE) but suffers from fluctuating performance in individual video sessions. In this paper, we present a novel approach that combines imitation learning with the information bottleneck technique, to learn from the complex offline optimal scenario rather than inefficient exploration. In particular, we leverage the deterministic offline bitrate optimization problem with the future throughput realization as the expert and formulate it as a mixed-integer non-linear programming (MINLP) problem. To enable large-scale training for improved performance, we propose an alternative optimization algorithm that efficiently solves the formulated MINLP problem. To address the overfitting issues due to the future information leakage in MINLP, we incorporate an adversarial information bottleneck framework. By compressing the video streaming state into a latent space, we retain only action-relevant information. Additionally, we introduce a future adversarial term to mitigate the influence of future information leakage, where Model Prediction Control (MPC) policy without any future information is employed as the adverse expert. Experimental results demonstrate the effectiveness of our proposed approach in significantly enhancing the quality of adaptive video streaming, providing a 7.30% average QoE improvement and a 30.01% average ranking reduction.
引用
收藏
页码:13670 / 13683
页数:14
相关论文
共 50 条
  • [31] Machine Learning Based Video Coding Enhancements for HTTP Adaptive Streaming
    Cetinkaya, Ekrem
    MMSYS '21: PROCEEDINGS OF THE 2021 MULTIMEDIA SYSTEMS CONFERENCE, 2021, : 418 - 422
  • [32] FedABR: A Personalized Federated Reinforcement Learning Approach for Adaptive Video Streaming
    Xu, Yeting
    Li, Xiang
    Yang, Yi
    Lin, Zhenjie
    Wang, Liming
    Li, Wenzhong
    2023 IFIP NETWORKING CONFERENCE, IFIP NETWORKING, 2023,
  • [33] ABRaider: Multiphase Reinforcement Learning for Environment-Adaptive Video Streaming
    Choi, Wangyu
    Chen, Jiasi
    Yoon, Jongwon
    IEEE ACCESS, 2022, 10 : 53108 - 53123
  • [34] Fastconv: Fast Learning based Adaptive BitRate Algorithm for Video Streaming
    Meng, Linghui
    Zhang, Fangyu
    Bo, Lei
    Lu, Hancheng
    Qin, Jin
    Han, Jiangping
    2019 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2019,
  • [35] ALVS: Adaptive Live Video Streaming using deep reinforcement learning
    Ozcelik, Ihsan Mert
    Ersoy, Cem
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2022, 205
  • [36] Learning Representations for Neural Network-Based Classification Using the Information Bottleneck Principle
    Amjad, Rana Ali
    Geiger, Bernhard C.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2020, 42 (09) : 2225 - 2239
  • [37] AR-GAIL: Adaptive routing protocol for FANETs using generative adversarial imitation learning
    Liu, Jianmin
    Wang, Qi
    Xu, Yongjun
    COMPUTER NETWORKS, 2022, 218
  • [38] Exploiting Video Quality Information With Lightweight Network Coordination for HTTP-Based Adaptive Video Streaming
    Lu, Zheng
    Ramakrishnan, Sangeeta
    Zhu, Xiaoqing
    IEEE TRANSACTIONS ON MULTIMEDIA, 2018, 20 (07) : 1848 - 1863
  • [39] Index-Aware Reinforcement Learning for Adaptive Video Streaming at the Wireless Edge
    Xiong, Guojun
    Qin, Xudong
    Li, Bin
    Singh, Rahul
    Li, Jian
    PROCEEDINGS OF THE 2022 THE TWENTY-THIRD INTERNATIONAL SYMPOSIUM ON THEORY, ALGORITHMIC FOUNDATIONS, AND PROTOCOL DESIGN FOR MOBILE NETWORKS AND MOBILE COMPUTING, MOBIHOC 2022, 2022, : 81 - 90
  • [40] Learning-based Fuzzy Bitrate Matching at the Edge for Adaptive Video Streaming
    Shi, Wanxin
    Li, Qing
    Wang, Chao
    Zou, Longhao
    Shen, Gengbiao
    Zhang, Pei
    Jiang, Yong
    PROCEEDINGS OF THE ACM WEB CONFERENCE 2022 (WWW'22), 2022, : 3289 - 3297