Blind image quality prediction by exploiting multi-level deep representations

被引:96
|
作者
Gao, Fei [1 ,2 ]
Yu, Jun [1 ]
Zhu, Suguo [1 ]
Huang, Qingming [3 ]
Han, Qi [4 ]
机构
[1] Hangzhou Dianzi Univ, Sch Comp Sci & Technol, Key Lab Complex Syst Modeling & Simulat, Hangzhou 310018, Zhejiang, Peoples R China
[2] Xidian Univ, State Key Lab Integrated Serv Network, Xian 710071, Shaanxi, Peoples R China
[3] Univ Chinese Acad Sci, Sch Comp & Control Engn, Beijing 100190, Peoples R China
[4] Univ Texas San Antonio, Dept Comp Sci, San Antonio, TX USA
基金
中国国家自然科学基金;
关键词
Image quality assessment; Deep learning; Convolutional Neural Networks (CNN); Multi-level deep representation; Support vector regression; NATURAL SCENE STATISTICS; FRAMEWORK;
D O I
10.1016/j.patcog.2018.04.016
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Blind image quality assessment (BIQA) aims at precisely estimating human perceived image quality with no access to a reference. Recently, several attempts have been made to develop BIQA methods based on deep neural networks (DNNs). Although these methods obtained promising performance, they have some limitations: (1) their DNN models are actually "shallow" in term of depth; and (2) these methods typically use the output of the last layer in the DNN model as the feature representation for quality prediction. Since the representation depth has been demonstrated beneficial for various vision tasks, it is significant to explore very deep networks for learning BIQA models. Besides, the information in the last layer may unduly generalize over local artifacts which are highly related to quality degradation. On the contrary, intermediate layers may be sensitive to local degradations but will not capture high-level semantics. Thus, reasoning at multiple levels of representation is necessary in the IQA task. In this paper, we propose to extract multi-level representations from a very deep DNN model for learning an effective BIQA model, and consequently present a simple but extraordinarily effective BIQA framework, codenamed BLINDER (BLind Image quality predictioN via multi-level DEep Representations). Thorough experiments have been conducted on five standard databases, which show that a significant improvement can be achieved by adopting multi-level deep representations. Besides, BLINDER considerably outperforms previous state-of-the-art BIQA methods for authentically distorted images. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:432 / 442
页数:11
相关论文
共 50 条
  • [31] Clean and robust multi-level subspace representations learning for deep multi-view subspace clustering
    Xu, Kaiqiang
    Tang, Kewei
    Su, Zhixun
    Tan, Hongchen
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 252
  • [32] A multi-level descriptor using ultra-deep feature for image retrieval
    Wu, Zebin
    Yu, Junqing
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (18) : 25655 - 25672
  • [33] A multi-level descriptor using ultra-deep feature for image retrieval
    Zebin Wu
    Junqing Yu
    [J]. Multimedia Tools and Applications, 2019, 78 : 25655 - 25672
  • [34] A deep-learning framework for multi-level peptide–protein interaction prediction
    Yipin Lei
    Shuya Li
    Ziyi Liu
    Fangping Wan
    Tingzhong Tian
    Shao Li
    Dan Zhao
    Jianyang Zeng
    [J]. Nature Communications, 12
  • [35] Joint Deep Model with Multi-Level Attention and Hybrid-Prediction for Recommendation
    Lin, Zhipeng
    Tang, Yuhua
    Zhang, Yongjun
    [J]. ENTROPY, 2019, 21 (02):
  • [36] Product Evaluation Prediction Model Based on Multi-Level Deep Feature Fusion
    Zhou, Qingyan
    Li, Hao
    Zhang, Youhua
    Zheng, Junhong
    [J]. FUTURE INTERNET, 2023, 15 (01):
  • [37] Multi-Level Deep Cascade Trees for Conversion Rate Prediction in Recommendation System
    Wen, Hong
    Zhang, Jing
    Lin, Quan
    Yang, Keping
    Huang, Pipei
    [J]. THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 338 - 345
  • [38] CONCISE REPRESENTATIONS FOR ASSOCIATION RULES IN MULTI-LEVEL DATASETS
    Xu, Yue
    Shaw, Gavin
    Li, Yuefeng
    [J]. JOURNAL OF SYSTEMS SCIENCE AND SYSTEMS ENGINEERING, 2009, 18 (01) : 53 - 70
  • [39] An evaluation of image quality for high quality digital halftoning - Image analysis and evaluation of multi-level error diffusion
    Miyata, K
    Saito, M
    Tsumura, N
    Miyake, Y
    [J]. HUMAN VISION AND ELECTRONIC IMAGING II, 1997, 3016 : 176 - 183
  • [40] CONCISE REPRESENTATIONS FOR ASSOCIATION RULES IN MULTI-LEVEL DATASETS
    Gavin SHAW
    [J]. Journal of Systems Science and Systems Engineering, 2009, 18 (01) : 53 - 70