Viewport Proposal CNN for 360° Video Quality Assessment

被引:49
|
作者
Li, Chen [1 ]
Xu, Mai [1 ,2 ]
Jiang, Lai [1 ]
Zhang, Shanyi [1 ]
Tao, Xiaoming [3 ]
机构
[1] Beihang Univ, Sch Elect & Informat Engn, Beijing, Peoples R China
[2] Beihang Univ, Hangzhou Innovat Inst HZII, Hangzhou, Zhejiang, Peoples R China
[3] Tsinghua Univ, Dept Elect Engn, Beijing, Peoples R China
关键词
PREDICTION; SALIENCY;
D O I
10.1109/CVPR.2019.01042
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent years have witnessed the growing interest in visual quality assessment (VQA) for 360 degrees video. Unfortunately, the existing VQA approaches do not consider the facts that: 1) Observers only see viewports of 360 degrees video, rather than patches or whole 360 degrees frames. 2) Within the viewport, only salient regions can be perceived by observers with high resolution. Thus, this paper proposes a viewport-based convolutional neural network (V-CNN) approach for VQA on 360 degrees video, considering both auxiliary tasks of viewport proposal and viewport saliency prediction. Our V-CNN approach is composed of two stages, i.e., viewport proposal and VQA. In the first stage, the viewport proposal network (VP-net) is developed to yield several potential viewports, seen as the first auxiliary task. In the second stage, a viewport quality network (VQ-net) is designed to rate the VQA score for each proposed viewport, in which the saliency map of the viewport is predicted and then utilized in VQA score rating. Consequently, another auxiliary task of viewport saliency prediction can be achieved. More importantly, the main task of VQA on 360 degrees video can be accomplished via integrating the VQA scores of all view ports. The experiments validate the effectiveness of our V-CNN approach in significantly advancing the state-of-the-art performance of VQA on 360 degrees video. In addition, our approach achieves comparable performance in two auxiliary tasks. The code of our V-CNN approach is available at https://github.com/Archer-Tatsu/V-CNN.
引用
收藏
页码:10169 / 10178
页数:10
相关论文
共 50 条
  • [1] Viewport-Based CNN: A Multi-Task Approach for Assessing 360° Video Quality
    Xu, Mai
    Jiang, Lai
    Li, Chen
    Wang, Zulin
    Tao, Xiaoming
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (04) : 2198 - 2215
  • [2] Proposal With Alignment: A Bi-Directional Transformer for 360° Video Viewport Proposal
    Guo, Yichen
    Xu, Mai
    Jiang, Lai
    Deng, Xin
    Zhou, Jing
    Chen, Gaoxing
    Sigal, Leonid
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (11) : 11423 - 11437
  • [3] 360° video quality assessment based on saliency-guided viewport extraction
    Yang, Fanxi
    Yang, Chao
    An, Ping
    Huang, Xinpeng
    MULTIMEDIA SYSTEMS, 2024, 30 (02)
  • [4] 360° video quality assessment based on saliency-guided viewport extraction
    Fanxi Yang
    Chao Yang
    Ping An
    Xinpeng Huang
    Multimedia Systems, 2024, 30
  • [5] Transitions of Viewport Quality Adaptation Mechanisms in 360 Degree Video Streaming
    Koch, Christian
    Rak, Arne-Tobias
    Zink, Michael
    Steinmetz, Ralf
    Rizk, Amr
    PROCEEDINGS OF THE 29TH ACM WORKSHOP ON NETWORK AND OPERATING SYSTEMS SUPPORT FOR DIGITAL AUDIO AND VIDEO (NOSSDAV'19), 2019, : 14 - 19
  • [6] Stable Viewport-Based Unsupervised Compressed 360° Video Quality Enhancement
    Zou, Zizhuang
    Ye, Mao
    Li, Xue
    Ji, Luping
    Zhu, Ce
    IEEE TRANSACTIONS ON BROADCASTING, 2024, 70 (02) : 607 - 619
  • [7] A VIEWPORT-DRIVEN MULTI-METRIC FUSION APPROACH FOR 360-DEGREE VIDEO QUALITY ASSESSMENT
    Azevedo, Roberto G. de A.
    Birkbeck, Neil
    Janatra, Ivan
    Adsumilli, Balu
    Frossard, Pascal
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [8] Smooth Viewport Bitrate Adaptation for 360 Video Streaming
    Hoang Le Dieu Huong
    Nguyen, Duc, V
    Truong Thu Huong
    Pham Ngoc Nam
    Truong Cong Thang
    PROCEEDINGS OF 2019 6TH NATIONAL FOUNDATION FOR SCIENCE AND TECHNOLOGY DEVELOPMENT (NAFOSTED) CONFERENCE ON INFORMATION AND COMPUTER SCIENCE (NICS), 2019, : 512 - 517
  • [9] Viewport-Dependent Saliency Prediction in 360° Video
    Qiao, Minglang
    Xu, Mai
    Wang, Zulin
    Borji, Ali
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 23 : 748 - 760
  • [10] Overview of 360-Degree Video and Viewport Prediction
    Li, Zhenhuai
    Zhan, Yinwei
    Computer Engineering and Applications, 2024, 60 (02)