Stereoscopic Video Quality Prediction Based on End-to-End Dual Stream Deep Neural Networks

被引:13
|
作者
Zhou, Wei [1 ]
Chen, Zhibo [1 ]
Li, Weiping [1 ]
机构
[1] Univ Sci & Technol China, Dept Elect Engn & Informat Sci, CAS Key Lab Technol Geospatial Informat Proc & Ap, Hefei 230027, Anhui, Peoples R China
关键词
Convolutional neural network; Stereoscopic video; No-reference video quality assessment; Spatiotemporal pooling;
D O I
10.1007/978-3-030-00764-5_44
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose a no-reference stereoscopic video quality assessment (NR-SVQA) method based on an end-to-end dual stream deep neural network (DNN), which incorporates left and right view sub-networks. The end-to-end dual stream network takes image patch pairs from left and right view pivotal frames as inputs and evaluates the perceptual quality of each image patch pair. By combining multiple convolution, max-pooling and fully-connected layers with regression in the framework, distortion related features are learned end-to-end and purely data driven. Then, a spatiotemporal pooling strategy is employed on these image patch pairs to estimate the entire stereoscopic video quality. The proposed network architecture, which we name End-to-end Dual stream deep Neural network (EDN), is trained and tested on the well-known stereoscopic video dataset divided by reference videos. Experimental results demonstrate that our proposed method outperforms state-of-the-art algorithms.
引用
收藏
页码:482 / 492
页数:11
相关论文
共 50 条
  • [21] End-to-End Multimodal Emotion Recognition Using Deep Neural Networks
    Tzirakis, Panagiotis
    Trigeorgis, George
    Nicolaou, Mihalis A.
    Schuller, Bjorn W.
    Zafeiriou, Stefanos
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2017, 11 (08) : 1301 - 1309
  • [22] END-TO-END SPEECH EMOTION RECOGNITION USING DEEP NEURAL NETWORKS
    Tzirakis, Panagiotis
    Zhang, Jiehao
    Schuller, Bjoern W.
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5089 - 5093
  • [23] MPNET: An End-to-End Deep Neural Network for Object Detection in Surveillance Video
    Wang, Hanyu
    Wang, Ping
    Qian, Xueming
    IEEE ACCESS, 2018, 6 : 30296 - 30308
  • [24] End-to-end video subtitle recognition via a deep Residual Neural Network
    Yan, Hongyu
    Xu, Xin
    PATTERN RECOGNITION LETTERS, 2020, 131 : 368 - 375
  • [25] deepTarget: End-to-end Learning Framework for microRNA Target Prediction using Deep Recurrent Neural Networks
    Lee, Byunghan
    Baek, Junghwan
    Park, Seunghyun
    Yoon, Sungroh
    PROCEEDINGS OF THE 7TH ACM INTERNATIONAL CONFERENCE ON BIOINFORMATICS, COMPUTATIONAL BIOLOGY, AND HEALTH INFORMATICS, 2016, : 434 - 442
  • [26] Remote Sensing Airport Detection Based on End-to-End Deep Transferable Convolutional Neural Networks
    Li, Shuai
    Xu, Yuelei
    Zhu, Mingming
    Ma, Shiping
    Tang, Hong
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2019, 16 (10) : 1640 - 1644
  • [27] Reliability of Deep Neural Networks for an End-to-End Imitation Learning-Based Lane Keeping
    Liu, Shen
    Mueller, Steffen
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (12) : 13768 - 13786
  • [28] An End-to-End Compression Framework Based on Convolutional Neural Networks
    Jiang, Feng
    Tao, Wen
    Liu, Shaohui
    Ren, Jie
    Guo, Xun
    Zhao, Debin
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (10) : 3007 - 3018
  • [29] An End-to-End Compression Framework Based on Convolutional Neural Networks
    Tao, Wen
    Jiang, Feng
    Zhang, Shengping
    Ren, Jie
    Shi, Wuzhen
    Zuo, Wangmeng
    Guo, Xun
    Zhao, Debin
    2017 DATA COMPRESSION CONFERENCE (DCC), 2017, : 463 - 463
  • [30] End-to-end prediction of protein-protein interaction based on embedding and recurrent neural networks
    Gonzalez-Lopez, Francisco
    Morales-Cordovilla, Juan A.
    Villegas-Morcillo, Amelia
    Gomez, Angel M.
    Sanchez, Victoria
    PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2018, : 2344 - 2350