Viewport Proposal CNN for 360° Video Quality Assessment

被引:49
|
作者
Li, Chen [1 ]
Xu, Mai [1 ,2 ]
Jiang, Lai [1 ]
Zhang, Shanyi [1 ]
Tao, Xiaoming [3 ]
机构
[1] Beihang Univ, Sch Elect & Informat Engn, Beijing, Peoples R China
[2] Beihang Univ, Hangzhou Innovat Inst HZII, Hangzhou, Zhejiang, Peoples R China
[3] Tsinghua Univ, Dept Elect Engn, Beijing, Peoples R China
关键词
PREDICTION; SALIENCY;
D O I
10.1109/CVPR.2019.01042
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent years have witnessed the growing interest in visual quality assessment (VQA) for 360 degrees video. Unfortunately, the existing VQA approaches do not consider the facts that: 1) Observers only see viewports of 360 degrees video, rather than patches or whole 360 degrees frames. 2) Within the viewport, only salient regions can be perceived by observers with high resolution. Thus, this paper proposes a viewport-based convolutional neural network (V-CNN) approach for VQA on 360 degrees video, considering both auxiliary tasks of viewport proposal and viewport saliency prediction. Our V-CNN approach is composed of two stages, i.e., viewport proposal and VQA. In the first stage, the viewport proposal network (VP-net) is developed to yield several potential viewports, seen as the first auxiliary task. In the second stage, a viewport quality network (VQ-net) is designed to rate the VQA score for each proposed viewport, in which the saliency map of the viewport is predicted and then utilized in VQA score rating. Consequently, another auxiliary task of viewport saliency prediction can be achieved. More importantly, the main task of VQA on 360 degrees video can be accomplished via integrating the VQA scores of all view ports. The experiments validate the effectiveness of our V-CNN approach in significantly advancing the state-of-the-art performance of VQA on 360 degrees video. In addition, our approach achieves comparable performance in two auxiliary tasks. The code of our V-CNN approach is available at https://github.com/Archer-Tatsu/V-CNN.
引用
收藏
页码:10169 / 10178
页数:10
相关论文
共 50 条
  • [21] Viewport-adaptive 360-degree video coding
    Hu, Qiang
    Zhou, Jun
    Zhang, Xiaoyun
    Shi, Zhiru
    Gao, Zhiyong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (17-18) : 12205 - 12226
  • [22] PARIMA: Viewport Adaptive 360-Degree Video Streaming
    Chopra, Lovish
    Chakraborty, Sarthak
    Mondal, Abhijit
    Chakraborty, Sandip
    PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2021 (WWW 2021), 2021, : 2379 - 2391
  • [23] Gaze-Assisted Viewport Control for 360° Video on Smartphone
    Linfeng Shen
    Yuchi Chen
    Jiangchuan Liu
    Journal of Computer Science and Technology, 2022, 37 : 906 - 918
  • [24] VAS360: QOE-DRIVEN VIEWPORT ADAPTIVE STREAMING FOR 360 VIDEO
    Hu, Yuxiang
    Liu, Yu
    Wang, Yumei
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2019, : 324 - 329
  • [25] Implementing Viewport Tile Extractor for Viewport-Adaptive 360-Degree Video Tiled Streaming
    Jeong, Jong-Beom
    Lee, Soonbin
    Kim, Inae
    Ryu, Eun-Seok
    35TH INTERNATIONAL CONFERENCE ON INFORMATION NETWORKING (ICOIN 2021), 2021, : 8 - 12
  • [26] Advancing User Quality of Experience Using Viewport Archives in Viewport-Aware Tile-Based 360-Degree Video Streaming
    Dziubinski, Kiana
    Bandai, Masaki
    2021 IEEE INTERNATIONAL WORKSHOP TECHNICAL COMMITTEE ON COMMUNICATIONS QUALITY AND RELIABILITY (CQR 2021), 2021,
  • [27] SVP: Sinusoidal Viewport Prediction for 360-Degree Video Streaming
    Jiang, Xiaolan
    Naas, Si Ahmed
    Chiang, Yi-Han
    Sigg, Stephan
    Ji, Yusheng
    IEEE ACCESS, 2020, 8 : 164471 - 164481
  • [28] Delivering 360-degree video with Viewport-adaptive Truncation
    Qiu, Tian
    Jain, Ish Kumar
    Wu, Raini
    Bharadia, Dinesh
    Cosman, Pamela
    2022 25TH INTERNATIONAL SYMPOSIUM ON WIRELESS PERSONAL MULTIMEDIA COMMUNICATIONS (WPMC), 2022,
  • [29] Efficient viewport prediction and tiling schemes for 360 degree video streaming
    Adhuran, Jayasingam
    Martini, Maria G.
    PROCEEDINGS OF THE 2024 15TH ACM MULTIMEDIA SYSTEMS CONFERENCE 2024, MMSYS 2024, 2024, : 374 - 380
  • [30] Optimized viewport-adaptive 360-degree video streaming
    Chen, Xiaolei
    Wu, Di
    Ahmad, Ishfaq
    CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2021, 6 (03) : 347 - 359