Viewport Proposal CNN for 360° Video Quality Assessment

被引:49
|
作者
Li, Chen [1 ]
Xu, Mai [1 ,2 ]
Jiang, Lai [1 ]
Zhang, Shanyi [1 ]
Tao, Xiaoming [3 ]
机构
[1] Beihang Univ, Sch Elect & Informat Engn, Beijing, Peoples R China
[2] Beihang Univ, Hangzhou Innovat Inst HZII, Hangzhou, Zhejiang, Peoples R China
[3] Tsinghua Univ, Dept Elect Engn, Beijing, Peoples R China
关键词
PREDICTION; SALIENCY;
D O I
10.1109/CVPR.2019.01042
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent years have witnessed the growing interest in visual quality assessment (VQA) for 360 degrees video. Unfortunately, the existing VQA approaches do not consider the facts that: 1) Observers only see viewports of 360 degrees video, rather than patches or whole 360 degrees frames. 2) Within the viewport, only salient regions can be perceived by observers with high resolution. Thus, this paper proposes a viewport-based convolutional neural network (V-CNN) approach for VQA on 360 degrees video, considering both auxiliary tasks of viewport proposal and viewport saliency prediction. Our V-CNN approach is composed of two stages, i.e., viewport proposal and VQA. In the first stage, the viewport proposal network (VP-net) is developed to yield several potential viewports, seen as the first auxiliary task. In the second stage, a viewport quality network (VQ-net) is designed to rate the VQA score for each proposed viewport, in which the saliency map of the viewport is predicted and then utilized in VQA score rating. Consequently, another auxiliary task of viewport saliency prediction can be achieved. More importantly, the main task of VQA on 360 degrees video can be accomplished via integrating the VQA scores of all view ports. The experiments validate the effectiveness of our V-CNN approach in significantly advancing the state-of-the-art performance of VQA on 360 degrees video. In addition, our approach achieves comparable performance in two auxiliary tasks. The code of our V-CNN approach is available at https://github.com/Archer-Tatsu/V-CNN.
引用
收藏
页码:10169 / 10178
页数:10
相关论文
共 50 条
  • [41] Fixed Viewport Applications for Omnidirectional Video Content Combining Traditional and 360 Video For Immersive Experiences
    Potetsianakis, Emmanouil
    Thomas, Emmanuel
    El Assal, Karim
    van Deventer, Oskar
    MMSYS'20: PROCEEDINGS OF THE 2020 MULTIMEDIA SYSTEMS CONFERENCE, 2020, : 369 - 372
  • [42] Local and Global Viewport History Sampling for Improved User Quality of Experience in Viewport-Aware Tile-Based 360-Degree Video Streaming
    Dziubinski, Kiana
    Bandai, Masaki
    IEEE ACCESS, 2024, 12 : 137455 - 137471
  • [43] VASTile: Viewport Adaptive Scalable 360-Degree Video Frame Tiling
    Madarasingha, Chamara
    Thilakarathna, Kanchana
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 4555 - 4563
  • [44] Online Bitrate Selection for Viewport Adaptive 360-Degree Video Streaming
    Tang, Ming
    Wong, Vincent W. S.
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2022, 21 (07) : 2506 - 2517
  • [45] Two-stream network with viewport selection for blind omnidirectional video quality assessment
    Chen, Junhao
    Niu, Yuzhen
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (04) : 12139 - 12157
  • [46] Adaptive Tiling Selection for Viewport Adaptive Streaming of 360-degree Video
    Nguyen, Duc V.
    Tran, Huyen T. T.
    Thang, Truong Cong
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2019, E102D (01) : 48 - 51
  • [47] Two-stream network with viewport selection for blind omnidirectional video quality assessment
    Junhao Chen
    Yuzhen Niu
    Multimedia Tools and Applications, 2024, 83 : 12139 - 12157
  • [48] CNN-MR for No Reference Video Quality Assessment
    Wang, Chunfeng
    Su, Li
    Huang, Qingming
    2017 4TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CONTROL ENGINEERING (ICISCE), 2017, : 224 - 228
  • [49] A Weighted Tile-based Approach for Viewport Adaptive 360° Video Streaming
    Yaqoob, Abid
    Muntean, Gabriel-Miro
    2020 IEEE INTERNATIONAL SYMPOSIUM ON BROADBAND MULTIMEDIA SYSTEMS AND BROADCASTING (BMSB), 2020,
  • [50] Viewport Prediction Method of 360 VR Video using Sound Localization Information
    Jeong, Eunyoung
    You, Dongho
    Hyun, Changjong
    Seo, Bong-Seok
    Kim, Namtae
    Kim, Dong Ho
    Lee, Ye Hoon
    2018 TENTH INTERNATIONAL CONFERENCE ON UBIQUITOUS AND FUTURE NETWORKS (ICUFN 2018), 2018, : 673 - 675