Attention-Based Deep Reinforcement Learning for Virtual Cinematography of 360° Videos

被引：5

作者：

Wang, Jianyi ^{[1
]}

Xu, Mai ^{[1
]}

Jiang, Lai ^{[1
]}

Song, Yuhang ^{[2
]}

机构：

[1] Beihang Univ, Sch Elect & Informat Engn, Beijing 100191, Peoples R China

[2] Univ Oxford, Somerville Coll, Dept Comp Sci, Oxford OX2 6HD, England

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2021年 / 23卷

基金：

北京市自然科学基金;

关键词：

360 degrees video; attention; deep reinforcement learning; SALIENCY PREDICTION; MODEL; IMAGES; HEAD; EYE; 2D;

D O I：

10.1109/TMM.2020.3021984

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Virtual cinematography refers to automatically selecting a natural-looking normal field-of-view (NFOV) from an entire 360 degrees video. In fact, virtual cinematography can be modeled as a deep reinforcement learning (DRL) problem, in which an agent makes actions related to NFOV selection according to the environment of 360 degrees video frames. More importantly, we find from our data analysis that the selected NFOVs attract significantly more attention than other regions, i.e., the NFOVs have high saliency. Therefore, in this paper, we propose an attention based DRL (A-DRL) approach for virtual cinematography in 360 degrees video. Specifically, we develop a new DRL framework for automatic NFOV selection with the input of both the content, and saliency map of each 360 degrees frame. Then, we propose a new reward function for the DRL framework in our approach, which considers the saliency values, ground-truth, and smooth transition for NFOV selection. Subsequently, a simplified DenseNet (called Mini-DenseNet) is designed to learn the optimal policy via maximizing the reward. Based on the learned policy, the actions of NFOV can be made in our A-DRL approach for virtual cinematography of 360 degrees video. Extensive experiments show that our A-DRL approach outperforms other state-of-the-art virtual cinematography methods, over the datasets of Sports-360 video, and Pano2Vid.

引用

页码：3227 / 3238

页数：12

共 50 条

[1] Saliency Computation for Virtual Cinematography in 360° Videos
Du, Ruofei
Varshney, Amitabh
[J]. IEEE COMPUTER GRAPHICS AND APPLICATIONS, 2021, 41 (04) : 99 - 106
[2] Attention-Based Deep Reinforcement Learning for Edge User Allocation
Chang, Jiaxin
Wang, Jian
Li, Bing
Zhao, Yuqi
Li, Duantengchuan
[J]. IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2024, 21 (01): : 590 - 604
[3] Learning to Drive at Unsignalized Intersections using Attention-based Deep Reinforcement Learning
Seong, Hyunki
Jung, Chanyoung
Lee, Seungwook
Shim, David Hyunchul
[J]. 2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 559 - 566
[4] Attention-based Deep Reinforcement Learning for Multi-view Environments
Barati, Elaheh
Chen, Xuewen
Zhong, Zichun
[J]. AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 1805 - 1807
[5] ATTENTION-BASED CURIOSITY-DRIVEN EXPLORATION IN DEEP REINFORCEMENT LEARNING
Reizinger, Patrik
Szemenyei, Marton
[J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 3542 - 3546
[6] Attention-based model and deep reinforcement learning for distribution of event processing tasks
Mazayev, Andriy
Al-Tam, Faroq
Correia, Noelia
[J]. INTERNET OF THINGS, 2022, 19
[7] ADRL: An attention-based deep reinforcement learning framework for knowledge graph reasoning
Wang, Qi
Hao, Yongsheng
Cao, Jie
[J]. KNOWLEDGE-BASED SYSTEMS, 2020, 197
[8] ARiADNE: A Reinforcement learning approach using Attention-based Deep Networks for Exploration
Cao, Yuhong
Hou, Tianxiang
Wang, Yizhuo
Yi, Xian
Sartoretti, Guillaume
[J]. 2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 10219 - 10225
[9] Attention-Based Mechanisms for Cognitive Reinforcement Learning
Gao, Yue
Li, Di
Chen, Xiangjian
Zhu, Junwu
[J]. APPLIED SCIENCES-BASEL, 2023, 13 (13):
[10] Attention-based Open RAN Slice Management using Deep Reinforcement Learning
Lotfi, Fatemeh
Afghah, Fatemeh
Ashdown, Jonathan
[J]. IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 6328 - 6333

← 1 2 3 4 5 →