Attention-Based Deep Reinforcement Learning for Virtual Cinematography of 360° Videos

被引:5
|
作者
Wang, Jianyi [1 ]
Xu, Mai [1 ]
Jiang, Lai [1 ]
Song, Yuhang [2 ]
机构
[1] Beihang Univ, Sch Elect & Informat Engn, Beijing 100191, Peoples R China
[2] Univ Oxford, Somerville Coll, Dept Comp Sci, Oxford OX2 6HD, England
基金
北京市自然科学基金;
关键词
360 degrees video; attention; deep reinforcement learning; SALIENCY PREDICTION; MODEL; IMAGES; HEAD; EYE; 2D;
D O I
10.1109/TMM.2020.3021984
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Virtual cinematography refers to automatically selecting a natural-looking normal field-of-view (NFOV) from an entire 360 degrees video. In fact, virtual cinematography can be modeled as a deep reinforcement learning (DRL) problem, in which an agent makes actions related to NFOV selection according to the environment of 360 degrees video frames. More importantly, we find from our data analysis that the selected NFOVs attract significantly more attention than other regions, i.e., the NFOVs have high saliency. Therefore, in this paper, we propose an attention based DRL (A-DRL) approach for virtual cinematography in 360 degrees video. Specifically, we develop a new DRL framework for automatic NFOV selection with the input of both the content, and saliency map of each 360 degrees frame. Then, we propose a new reward function for the DRL framework in our approach, which considers the saliency values, ground-truth, and smooth transition for NFOV selection. Subsequently, a simplified DenseNet (called Mini-DenseNet) is designed to learn the optimal policy via maximizing the reward. Based on the learned policy, the actions of NFOV can be made in our A-DRL approach for virtual cinematography of 360 degrees video. Extensive experiments show that our A-DRL approach outperforms other state-of-the-art virtual cinematography methods, over the datasets of Sports-360 video, and Pano2Vid.
引用
收藏
页码:3227 / 3238
页数:12
相关论文
共 50 条
  • [1] Saliency Computation for Virtual Cinematography in 360° Videos
    Du, Ruofei
    Varshney, Amitabh
    [J]. IEEE COMPUTER GRAPHICS AND APPLICATIONS, 2021, 41 (04) : 99 - 106
  • [2] Attention-Based Deep Reinforcement Learning for Edge User Allocation
    Chang, Jiaxin
    Wang, Jian
    Li, Bing
    Zhao, Yuqi
    Li, Duantengchuan
    [J]. IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2024, 21 (01): : 590 - 604
  • [3] Learning to Drive at Unsignalized Intersections using Attention-based Deep Reinforcement Learning
    Seong, Hyunki
    Jung, Chanyoung
    Lee, Seungwook
    Shim, David Hyunchul
    [J]. 2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 559 - 566
  • [4] Attention-based Deep Reinforcement Learning for Multi-view Environments
    Barati, Elaheh
    Chen, Xuewen
    Zhong, Zichun
    [J]. AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 1805 - 1807
  • [5] ATTENTION-BASED CURIOSITY-DRIVEN EXPLORATION IN DEEP REINFORCEMENT LEARNING
    Reizinger, Patrik
    Szemenyei, Marton
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 3542 - 3546
  • [6] Attention-based model and deep reinforcement learning for distribution of event processing tasks
    Mazayev, Andriy
    Al-Tam, Faroq
    Correia, Noelia
    [J]. INTERNET OF THINGS, 2022, 19
  • [7] ADRL: An attention-based deep reinforcement learning framework for knowledge graph reasoning
    Wang, Qi
    Hao, Yongsheng
    Cao, Jie
    [J]. KNOWLEDGE-BASED SYSTEMS, 2020, 197
  • [8] ARiADNE: A Reinforcement learning approach using Attention-based Deep Networks for Exploration
    Cao, Yuhong
    Hou, Tianxiang
    Wang, Yizhuo
    Yi, Xian
    Sartoretti, Guillaume
    [J]. 2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 10219 - 10225
  • [9] Attention-Based Mechanisms for Cognitive Reinforcement Learning
    Gao, Yue
    Li, Di
    Chen, Xiangjian
    Zhu, Junwu
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (13):
  • [10] Attention-based Open RAN Slice Management using Deep Reinforcement Learning
    Lotfi, Fatemeh
    Afghah, Fatemeh
    Ashdown, Jonathan
    [J]. IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 6328 - 6333