Blind image quality prediction by exploiting multi-level deep representations

被引:96
|
作者
Gao, Fei [1 ,2 ]
Yu, Jun [1 ]
Zhu, Suguo [1 ]
Huang, Qingming [3 ]
Han, Qi [4 ]
机构
[1] Hangzhou Dianzi Univ, Sch Comp Sci & Technol, Key Lab Complex Syst Modeling & Simulat, Hangzhou 310018, Zhejiang, Peoples R China
[2] Xidian Univ, State Key Lab Integrated Serv Network, Xian 710071, Shaanxi, Peoples R China
[3] Univ Chinese Acad Sci, Sch Comp & Control Engn, Beijing 100190, Peoples R China
[4] Univ Texas San Antonio, Dept Comp Sci, San Antonio, TX USA
基金
中国国家自然科学基金;
关键词
Image quality assessment; Deep learning; Convolutional Neural Networks (CNN); Multi-level deep representation; Support vector regression; NATURAL SCENE STATISTICS; FRAMEWORK;
D O I
10.1016/j.patcog.2018.04.016
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Blind image quality assessment (BIQA) aims at precisely estimating human perceived image quality with no access to a reference. Recently, several attempts have been made to develop BIQA methods based on deep neural networks (DNNs). Although these methods obtained promising performance, they have some limitations: (1) their DNN models are actually "shallow" in term of depth; and (2) these methods typically use the output of the last layer in the DNN model as the feature representation for quality prediction. Since the representation depth has been demonstrated beneficial for various vision tasks, it is significant to explore very deep networks for learning BIQA models. Besides, the information in the last layer may unduly generalize over local artifacts which are highly related to quality degradation. On the contrary, intermediate layers may be sensitive to local degradations but will not capture high-level semantics. Thus, reasoning at multiple levels of representation is necessary in the IQA task. In this paper, we propose to extract multi-level representations from a very deep DNN model for learning an effective BIQA model, and consequently present a simple but extraordinarily effective BIQA framework, codenamed BLINDER (BLind Image quality predictioN via multi-level DEep Representations). Thorough experiments have been conducted on five standard databases, which show that a significant improvement can be achieved by adopting multi-level deep representations. Besides, BLINDER considerably outperforms previous state-of-the-art BIQA methods for authentically distorted images. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:432 / 442
页数:11
相关论文
共 50 条
  • [1] Learning Multi-level Deep Representations for Image Emotion Classification
    Tianrong Rao
    Xiaoxu Li
    Min Xu
    [J]. Neural Processing Letters, 2020, 51 : 2043 - 2061
  • [2] Learning Multi-level Deep Representations for Image Emotion Classification
    Rao, Tianrong
    Li, Xiaoxu
    Xu, Min
    [J]. NEURAL PROCESSING LETTERS, 2020, 51 (03) : 2043 - 2061
  • [3] DEEP BLIND SYNTHESIZED IMAGE QUALITY ASSESSMENT WITH CONTEXTUAL MULTI-LEVEL FEATURE POOLING
    Wang, Xiaochuan
    Wang, Kai
    Yang, Bailin
    Li, Frederick W. B.
    Liang, Xiaohui
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, : 435 - 439
  • [4] BLIND STEREOSCOPIC IMAGE QUALITY ASSESSMENT BY DEEP NEURAL NETWORK OF MULTI-LEVEL FEATURE FUSION
    Yan, Jiebin
    Fang, Yuming
    Huang, Liping
    Min, Xiongkuo
    Yao, Yiru
    Zhai, Guangtao
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [5] No-Reference Quality Assessment of Pan-Sharpening Images with Multi-Level Deep Image Representations
    Stepien, Igor
    Oszust, Mariusz
    [J]. REMOTE SENSING, 2022, 14 (05)
  • [6] Learning multi-level and multi-scale deep representations for privacy image classification
    Han, Yahui
    Huang, Yonggang
    Pan, Lei
    Zheng, Yunbo
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (02) : 2259 - 2274
  • [7] Learning multi-level and multi-scale deep representations for privacy image classification
    Yahui Han
    Yonggang Huang
    Lei Pan
    Yunbo Zheng
    [J]. Multimedia Tools and Applications, 2022, 81 : 2259 - 2274
  • [8] Learning Multi-level Representations for Image Emotion Recognition in the Deep Convolutional Network
    Zhang, Hao
    Liu, Yanan
    Xu, Dan
    He, Kangjian
    Peng, Guoqin
    Yue, Yingying
    Liu, Ruhan
    [J]. THIRTEENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING (ICGIP 2021), 2022, 12083
  • [9] Image Blind Deblurring Algorithm Based on Deep Multi-level Wavelet Transform
    Chen Shuzhen
    Cao Shipeng
    Cui Meiyue
    Lian Qiusheng
    [J]. JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2021, 43 (01) : 154 - 161
  • [10] Image Blind Deblurring Algorithm Based on Deep Multi-level Wavelet Transform
    Chen, Shuzhen
    Cao, Shipeng
    Cui, Meiyue
    Lian, Qiusheng
    [J]. Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2021, 43 (01): : 154 - 161