A Two-Stage Deep Reinforcement Learning Framework for MEC-Enabled Adaptive 360-Degree Video Streaming

被引:0
|
作者
Bi, Suzhi [1 ]
Chen, Haoguo [1 ]
Li, Xian [1 ]
Wang, Shuoyao [1 ]
Wu, Yuan [2 ]
Qian, Liping [3 ]
机构
[1] Shenzhen Univ, Coll Elect & Informat Engn, State Key Lab Radio Frequency Heterogeneous Integ, Shenzhen 518060, Peoples R China
[2] Univ Macau, Dept Comp & Informat Sci, State Key Lab Internet Things Smart City, Taipa, Macao, Peoples R China
[3] Zhejiang Univ Technol, Coll Informat Engn, Hangzhou, Zhejiang, Peoples R China
关键词
Streaming media; Bit rate; Quality of experience; Wireless communication; Real-time systems; Resists; Accuracy; Adaptive streaming; multi-access edge computing (MEC); quality of experience (QoE); deep reinforcement learning; RATE ADAPTATION; PREDICTION; COMMUNICATION;
D O I
10.1109/TMC.2024.3443200
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The emerging multi-access edge computing (MEC) technology effectively enhances the wireless streaming performance of 360-degree videos. By connecting a user's head-mounted device (HMD) to a smart MEC platform, the edge server (ES) can efficiently perform adaptive tile-based video streaming to improve the user's viewing experience. Under constrained wireless channel capacity, the ES can predict the user's field of view (FoV) and transmit to the HMD high-resolution video tiles only within the predicted FoV. In practice, the video streaming performance is challenged by the random FoV prediction error and wireless channel fading effects. For this, we propose in this paper a novel two-stage adaptive 360-degree video streaming scheme that maximizes the user's quality of experience (QoE) to attain stable and high-resolution video playback. Specifically, we divide the video file into groups of pictures (GOPs) of fixed playback interval, where each GOP consists of a number of video frames. At the beginning of each GOP (i.e., the inter-GOP stage), the ES predicts the FoV of the next GOP and allocates an encoding bitrate for transmitting (precaching) the video tiles within the predicted FoV. Then, during the real-time video playback of the current GOP (i.e., the intra-GOP stage), the ES observes the user's true FoV of each frame and transmits the missing tiles to compensate for the FoV prediction errors. To maximize the user's QoE under random variations of FoV and wireless channel, we propose a double-agent deep reinforcement learning framework, where the two agents operate in different time scales to decide the bitrates of inter- and intra-GOP stages, respectively. Experiments based on real-world measurements show that the proposed scheme can effectively mitigate FoV prediction errors and maintain stable QoE performance under different scenarios, achieving over 22.1% higher QoE than some representative benchmark methods.
引用
收藏
页码:14313 / 14329
页数:17
相关论文
共 50 条
  • [21] ATOM: Adaptive Task Offloading With Two-Stage Hybrid Matching in MEC-Enabled Industrial IoT
    Chi, Jiancheng
    Qiu, Tie
    Xiao, Fu
    Zhou, Xiaobo
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (05) : 4861 - 4877
  • [22] Online Bitrate Selection for Viewport Adaptive 360-Degree Video Streaming
    Tang, Ming
    Wong, Vincent W. S.
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2022, 21 (07) : 2506 - 2517
  • [23] Tile Rate Allocation for 360-Degree Tiled Adaptive Video Streaming
    Yadav, Praveen Kumar
    Ooi, Wei Tsang
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 3724 - 3733
  • [24] User-Adaptive Editing for 360° Video Streaming with Deep Reinforcement Learning
    Sassatelli, Lucile
    Winckler, Marco
    Fisichella, Thomas
    Aparicio, Ramon
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 2208 - 2210
  • [25] Personalized 360-Degree Video Streaming: A Meta-Learning Approach
    Lu, Yiyun
    Zhu, Yifei
    Wang, Zhi
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 3143 - 3151
  • [26] JOINT REINFORCEMENT LEARNING AND GAME THEORY BITRATE CONTROL METHOD FOR 360-DEGREE DYNAMIC ADAPTIVE STREAMING
    Wei, Xuekai
    Zhou, Mingliang
    Kwong, Sam
    Yuan, Hui
    Xiang, Tao
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 4230 - 4234
  • [27] A Deep Reinforcement Learning Approach for Service Migration in MEC-enabled Vehicular Networks
    Abouaomar, Amine
    Mlika, Zoubeir
    Filali, Abderrahime
    Cherkaoui, Soumaya
    Kobbane, Abdellatif
    PROCEEDINGS OF THE IEEE 46TH CONFERENCE ON LOCAL COMPUTER NETWORKS (LCN 2021), 2021, : 273 - 280
  • [28] Resource Allocation in MEC-enabled Vehicular Networks: A Deep Reinforcement Learning Approach
    Tan, Guoping
    Zhang, Huipeng
    Zhou, Siyuan
    IEEE INFOCOM 2020 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (INFOCOM WKSHPS), 2020, : 406 - 411
  • [29] Plato: Learning-based Adaptive Streaming of 360-Degree Videos
    Jiang, Xiaolan
    Chiang, Yi-Han
    Zhao, Yang
    Ji, Yusheng
    PROCEEDINGS OF THE 2018 IEEE 43RD CONFERENCE ON LOCAL COMPUTER NETWORKS (LCN), 2018, : 393 - 400
  • [30] 360-DEGREE IMAGE COMPLETION BY TWO-STAGE CONDITIONAL GANS
    Akimoto, Naofumi
    Kasai, Seito
    Hayashi, Masaki
    Aoki, Yoshimitsu
    2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 4704 - 4708