End-to-End Blind Quality Assessment of Compressed Videos Using Deep Neural Networks

被引:78
|
作者
Liu, Wentao [1 ]
Duanmu, Zhengfang [1 ]
Wang, Zhou [1 ]
机构
[1] Univ Waterloo, Waterloo, ON, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Blind video quality assessment; convolutional neural network; multi-task learning;
D O I
10.1145/3240508.3240643
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Blind video quality assessment (BVQA) algorithms are traditionally designed with a two-stage approach - a feature extraction stage that computes typically hand-crafted spatial and/or temporal features, and a regression stage working in the feature space that predicts the perceptual quality of the video. Unlike the traditional BVQA methods, we propose a Video Multi-task End-to-end Optimized neural Network (V-MEON) that merges the two stages into one, where the feature extractor and the regressor are jointly optimized. Our model uses a multi-task DNN framework that not only estimates the perceptual quality of the test video but also provides a probabilistic prediction of its codec type. This framework allows us to train the network with two complementary sets of labels, both of which can be obtained at low cost. The training process is composed of two steps. In the first step, early convolutional layers are pre-trained to extract spatiotemporal quality-related features with the codec classification subtask. In the second step, initialized with the pre-trained feature extractor, the whole network is jointly optimized with the two subtasks together. An additional critical step is the adoption of 3D convolutional layers, which creates novel spatiotemporal features that lead to a significant performance boost. Experimental results show that the proposed model clearly outperforms state-of-the-art BVQA methods.The source code of V-MEON is available at https://ece.uwaterloo.ca/zduanmu/acmmm2018bvqa.
引用
收藏
页码:546 / 554
页数:9
相关论文
共 50 条
  • [31] Automating detection and localization of myocardial infarction using shallow and end-to-end deep neural networks
    Jafarian, Kamal
    Vahdat, Vahab
    Salehi, Seyedmohammad
    Mobin, Mohammadsadegh
    APPLIED SOFT COMPUTING, 2020, 93
  • [32] End-to-end multimodal clinical depression recognition using deep neural networks: A comparative analysis
    Muzammel, Muhammad
    Salam, Hanan
    Othmani, Alice
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2021, 211
  • [33] End-to-end image quality assessment
    Raventos, Joaquin
    VISUAL INFORMATION PROCESSING XXI, 2012, 8399
  • [34] DeepSigns: An End-to-End Watermarking Framework for Ownership Protection of Deep Neural Networks
    Rouhani, Bita Darvish
    Chen, Huili
    Koushanfar, Farinaz
    TWENTY-FOURTH INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS (ASPLOS XXIV), 2019, : 485 - 497
  • [35] Towards End-to-End Speech Recognition with Deep Multipath Convolutional Neural Networks
    Zhang, Wei
    Zhai, Minghao
    Huang, Zilong
    Liu, Chen
    Li, Wei
    Cao, Yi
    INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2019, PART VI, 2019, 11745 : 332 - 341
  • [36] Leukocyte Segmentation via End-to-End Learning of Deep Convolutional Neural Networks
    Lu, Yan
    Fan, Haoyi
    Li, Zuoyong
    INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING: VISUAL DATA ENGINEERING, PT I, 2019, 11935 : 191 - 200
  • [37] End-to-end 3D face reconstruction with deep neural networks
    Dou, Pengfei
    Shah, Shishir K.
    Kakadiaris, Ioannis A.
    30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1503 - 1512
  • [38] A Theoretical Framework for End-to-End Learning of Deep Neural Networks With Applications to Robotics
    Li, Sitan
    Nguyen, Huu-Thiet
    Cheah, Chien Chern
    IEEE ACCESS, 2023, 11 : 21992 - 22006
  • [39] End-to-end Relation Extraction using Neural Networks and Markov Logic Networks
    Pawar, Sachin
    Bhattacharyya, Pushpak
    Palshikar, Girish K.
    15TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2017), VOL 1: LONG PAPERS, 2017, : 818 - 827
  • [40] An End-to-End System for Unconstrained Face Verification with Deep Convolutional Neural Networks
    Chen, Jun-Cheng
    Ranjan, Rajeev
    Kumar, Amit
    Chen, Ching-Hui
    Patel, Vishal M.
    Chellappa, Rama
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOP (ICCVW), 2015, : 360 - 368