End-to-End Blind Quality Assessment of Compressed Videos Using Deep Neural Networks

被引:78
|
作者
Liu, Wentao [1 ]
Duanmu, Zhengfang [1 ]
Wang, Zhou [1 ]
机构
[1] Univ Waterloo, Waterloo, ON, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Blind video quality assessment; convolutional neural network; multi-task learning;
D O I
10.1145/3240508.3240643
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Blind video quality assessment (BVQA) algorithms are traditionally designed with a two-stage approach - a feature extraction stage that computes typically hand-crafted spatial and/or temporal features, and a regression stage working in the feature space that predicts the perceptual quality of the video. Unlike the traditional BVQA methods, we propose a Video Multi-task End-to-end Optimized neural Network (V-MEON) that merges the two stages into one, where the feature extractor and the regressor are jointly optimized. Our model uses a multi-task DNN framework that not only estimates the perceptual quality of the test video but also provides a probabilistic prediction of its codec type. This framework allows us to train the network with two complementary sets of labels, both of which can be obtained at low cost. The training process is composed of two steps. In the first step, early convolutional layers are pre-trained to extract spatiotemporal quality-related features with the codec classification subtask. In the second step, initialized with the pre-trained feature extractor, the whole network is jointly optimized with the two subtasks together. An additional critical step is the adoption of 3D convolutional layers, which creates novel spatiotemporal features that lead to a significant performance boost. Experimental results show that the proposed model clearly outperforms state-of-the-art BVQA methods.The source code of V-MEON is available at https://ece.uwaterloo.ca/zduanmu/acmmm2018bvqa.
引用
收藏
页码:546 / 554
页数:9
相关论文
共 50 条
  • [11] Image Shadow Removal Using End-To-End Deep Convolutional Neural Networks
    Fan, Hui
    Han, Meng
    Li, Jinjiang
    APPLIED SCIENCES-BASEL, 2019, 9 (05):
  • [12] A study on tooth segmentation and numbering using end-to-end deep neural networks
    Silva, Bernardo
    Pinheiro, Lais
    Oliveira, Luciano
    Pithon, Matheus
    2020 33RD SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI 2020), 2020, : 164 - 171
  • [13] Separation of Nonlinearly Mixed Sources Using End-to-End Deep Neural Networks
    Zamani, Hojatollah
    Razavikia, Saeed
    Otroshi-Shahreza, Hatef
    Amini, Arash
    IEEE SIGNAL PROCESSING LETTERS, 2020, 27 : 101 - 105
  • [14] DeepLanes: End-To-End Lane Position Estimation using Deep Neural Networks
    Gurghian, Alexandru
    Koduri, Tejaswi
    Bailur, Smita V.
    Carey, Kyle J.
    Murali, Vidya N.
    PROCEEDINGS OF 29TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, (CVPRW 2016), 2016, : 38 - 45
  • [15] DeepAttest: An End-to-End Attestation Framework for Deep Neural Networks
    Chen, Huili
    Fu, Cheng
    Rouhani, Bita Darvish
    Zhao, Jishen
    Koushanfar, Farinaz
    PROCEEDINGS OF THE 2019 46TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA '19), 2019, : 487 - 498
  • [16] END-TO-END OPTIMIZED SPEECH CODING WITH DEEP NEURAL NETWORKS
    Kankanahalli, Srihari
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 2521 - 2525
  • [17] End-to-End Training of Deep Neural Networks in the Fourier Domain
    Fulop, Andras
    Horvath, Andras
    MATHEMATICS, 2022, 10 (12)
  • [18] Handwriting-Based Gender Classification Using End-to-End Deep Neural Networks
    Illouz, Evyatar
    David, Eli
    Netanyahu, Nathan S.
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT III, 2018, 11141 : 613 - 621
  • [19] An end-to-end deep learning system for requirements classification using recurrent neural networks
    AlDhafer, Osamah
    Ahmad, Irfan
    Mahmood, Sajjad
    INFORMATION AND SOFTWARE TECHNOLOGY, 2022, 147
  • [20] Stereoscopic Video Quality Prediction Based on End-to-End Dual Stream Deep Neural Networks
    Zhou, Wei
    Chen, Zhibo
    Li, Weiping
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING, PT III, 2018, 11166 : 482 - 492