A Two-Stage Deep Reinforcement Learning Framework for MEC-Enabled Adaptive 360-Degree Video Streaming

被引：0

作者：

Bi, Suzhi ^{[1
]}

Chen, Haoguo ^{[1
]}

Li, Xian ^{[1
]}

Wang, Shuoyao ^{[1
]}

Wu, Yuan ^{[2
]}

Qian, Liping ^{[3
]}

机构：

[1] Shenzhen Univ, Coll Elect & Informat Engn, State Key Lab Radio Frequency Heterogeneous Integ, Shenzhen 518060, Peoples R China

[2] Univ Macau, Dept Comp & Informat Sci, State Key Lab Internet Things Smart City, Taipa, Macao, Peoples R China

[3] Zhejiang Univ Technol, Coll Informat Engn, Hangzhou, Zhejiang, Peoples R China

来源：

IEEE TRANSACTIONS ON MOBILE COMPUTING | 2024年 / 23卷 / 12期

关键词：

Streaming media; Bit rate; Quality of experience; Wireless communication; Real-time systems; Resists; Accuracy; Adaptive streaming; multi-access edge computing (MEC); quality of experience (QoE); deep reinforcement learning; RATE ADAPTATION; PREDICTION; COMMUNICATION;

D O I：

10.1109/TMC.2024.3443200

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The emerging multi-access edge computing (MEC) technology effectively enhances the wireless streaming performance of 360-degree videos. By connecting a user's head-mounted device (HMD) to a smart MEC platform, the edge server (ES) can efficiently perform adaptive tile-based video streaming to improve the user's viewing experience. Under constrained wireless channel capacity, the ES can predict the user's field of view (FoV) and transmit to the HMD high-resolution video tiles only within the predicted FoV. In practice, the video streaming performance is challenged by the random FoV prediction error and wireless channel fading effects. For this, we propose in this paper a novel two-stage adaptive 360-degree video streaming scheme that maximizes the user's quality of experience (QoE) to attain stable and high-resolution video playback. Specifically, we divide the video file into groups of pictures (GOPs) of fixed playback interval, where each GOP consists of a number of video frames. At the beginning of each GOP (i.e., the inter-GOP stage), the ES predicts the FoV of the next GOP and allocates an encoding bitrate for transmitting (precaching) the video tiles within the predicted FoV. Then, during the real-time video playback of the current GOP (i.e., the intra-GOP stage), the ES observes the user's true FoV of each frame and transmits the missing tiles to compensate for the FoV prediction errors. To maximize the user's QoE under random variations of FoV and wireless channel, we propose a double-agent deep reinforcement learning framework, where the two agents operate in different time scales to decide the bitrates of inter- and intra-GOP stages, respectively. Experiments based on real-world measurements show that the proposed scheme can effectively mitigate FoV prediction errors and maintain stable QoE performance under different scenarios, achieving over 22.1% higher QoE than some representative benchmark methods.

引用

页码：14313 / 14329

页数：17

共 50 条

[21] ATOM: Adaptive Task Offloading With Two-Stage Hybrid Matching in MEC-Enabled Industrial IoT
Chi, Jiancheng
Qiu, Tie
Xiao, Fu
Zhou, Xiaobo
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (05) : 4861 - 4877
[22] Online Bitrate Selection for Viewport Adaptive 360-Degree Video Streaming
Tang, Ming
Wong, Vincent W. S.
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2022, 21 (07) : 2506 - 2517
[23] Tile Rate Allocation for 360-Degree Tiled Adaptive Video Streaming
Yadav, Praveen Kumar
Ooi, Wei Tsang
MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 3724 - 3733
[24] User-Adaptive Editing for 360° Video Streaming with Deep Reinforcement Learning
Sassatelli, Lucile
Winckler, Marco
Fisichella, Thomas
Aparicio, Ramon
PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 2208 - 2210
[25] Personalized 360-Degree Video Streaming: A Meta-Learning Approach
Lu, Yiyun
Zhu, Yifei
Wang, Zhi
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 3143 - 3151
[26] JOINT REINFORCEMENT LEARNING AND GAME THEORY BITRATE CONTROL METHOD FOR 360-DEGREE DYNAMIC ADAPTIVE STREAMING
Wei, Xuekai
Zhou, Mingliang
Kwong, Sam
Yuan, Hui
Xiang, Tao
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 4230 - 4234
[27] A Deep Reinforcement Learning Approach for Service Migration in MEC-enabled Vehicular Networks
Abouaomar, Amine
Mlika, Zoubeir
Filali, Abderrahime
Cherkaoui, Soumaya
Kobbane, Abdellatif
PROCEEDINGS OF THE IEEE 46TH CONFERENCE ON LOCAL COMPUTER NETWORKS (LCN 2021), 2021, : 273 - 280
[28] Resource Allocation in MEC-enabled Vehicular Networks: A Deep Reinforcement Learning Approach
Tan, Guoping
Zhang, Huipeng
Zhou, Siyuan
IEEE INFOCOM 2020 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS WORKSHOPS (INFOCOM WKSHPS), 2020, : 406 - 411
[29] Plato: Learning-based Adaptive Streaming of 360-Degree Videos
Jiang, Xiaolan
Chiang, Yi-Han
Zhao, Yang
Ji, Yusheng
PROCEEDINGS OF THE 2018 IEEE 43RD CONFERENCE ON LOCAL COMPUTER NETWORKS (LCN), 2018, : 393 - 400
[30] 360-DEGREE IMAGE COMPLETION BY TWO-STAGE CONDITIONAL GANS
Akimoto, Naofumi
Kasai, Seito
Hayashi, Masaki
Aoki, Yoshimitsu
2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 4704 - 4708

← 1 2 3 4 5 →