Dissecting the effectiveness of deep features as metric of perceptual image quality

被引:0
|
作者
Hernandez-Camara, Pablo [1 ]
Vila-Tomas, Jorge [1 ]
Laparra, Valero [1 ]
Malo, Jesus [1 ]
机构
[1] Univ Valencia, Image Proc Lab, Paterna 46980, Spain
关键词
Image quality; Neural networks; Visual neuroscience; Functional principle; Learning environment; Architecture; MODELS; INFORMATION; VISION;
D O I
10.1016/j.neunet.2025.107189
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There is an open debate on the role of artificial networks to understand the visual brain. Internal representations of images in artificial networks develop human-like properties. In particular, evaluating distortions using differences between internal features is correlated to human perception of distortion. However, the origins of this correlation are not well understood. Here, we dissect the different factors involved in the emergence of human-like behavior: function, architecture, and environment. To do so, we evaluate the aforementioned human-network correlation at different depths of 46 pre-trained model configurations that include no psycho-visual information. The results show that most of the models correlate better with human opinion than SSIM (a de-facto standard in subjective image quality). Moreover, some models are better than state-of-the-art networks specifically tuned for the application (LPIPS, DISTS). Regarding the function, supervised classification leads to nets that correlate better with humans than the explored models for self- and non-supervised tasks. However, we found that better performance in the task does not imply more human behavior. Regarding the architecture, simpler models correlate better with humans than very deep nets and generally, the highest correlation is not achieved in the last layer. Finally, regarding the environment, training with large natural datasets leads to bigger correlations than training in smaller databases with restricted content, as expected. We also found that the best classification models are not the best for predicting human distances. In the general debate about understanding human vision, our empirical findings imply that explanations have not to be focused on a single abstraction level, but all function, architecture, and environment are relevant.
引用
收藏
页数:14
相关论文
共 50 条
  • [11] DEEP PERCEPTUAL IMAGE QUALITY ASSESSMENT FOR COMPRESSION
    Mier, Juan Carlos
    Huang, Eddie
    Talebi, Hossein
    Yang, Feng
    Milanfar, Peyman
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 1484 - 1488
  • [12] Deep ensembling for perceptual image quality assessment
    Nisar Ahmed
    H. M. Shahzad Asif
    Abdul Rauf Bhatti
    Atif Khan
    Soft Computing, 2022, 26 : 7601 - 7622
  • [13] A perceptual quality metric for image fusion based on regional information
    Chen, H
    Varshney, PK
    Multisensor, Multisource Information Fusion: Architectures, Algorithms and Applications 2005, 2005, 5813 : 34 - 45
  • [14] Using Deep Perceptual Embeddings as a Quality Metric for Synthetic Imagery
    Luer, William
    Harguess, Josh
    GEOSPATIAL INFORMATICS XI, 2021, 11733
  • [15] Spread Spectrum Image Watermarking Based on Perceptual Quality Metric
    Zhang, Fan
    Liu, Wenyu
    Lin, Weisi
    Ngan, King Ngi
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2011, 20 (11) : 3207 - 3218
  • [16] On the development of a reduced-reference perceptual image quality metric
    Kusuma, TM
    Zepernick, HJ
    Caldera, M
    2005 SYSTEMS COMMUNICATIONS, PROCEEDINGS: ICW 2005, WIRELESS TECHNOLOGIES; ICHSN 2005, HIGH SPEED NETWORKS; ICMCS 2005, MULTIMEDIA COMMUNICATIONS SYSTEMS; SENET 2005, SENSOR NETWORKS, 2005, : 178 - 184
  • [17] Blind image quality assessment based on statistics features and perceptual features
    Zhao, Youen
    Ji, Xiuhua
    Liu, Zhaoguang
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 38 (03) : 3515 - 3526
  • [18] A Perceptual Image Quality Assessment Metric Using Singular Value Decomposition
    Wang, Shuigen
    Cui, Dongshun
    Wang, Baoxian
    Zhao, Baojun
    Yang, Jinglin
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2015, 34 (01) : 209 - 229
  • [19] Pseudo No Reference image quality metric using perceptual data hiding
    Ninassi, Alexandre
    Le Callet, Patrick
    Autrusseau, Florent
    HUMAN VISION AND ELECTRONIC IMAGING XI, 2006, 6057
  • [20] A perceptual metric for stereoscopic image quality assessment based on the binocular energy
    Rafik Bensalma
    Mohamed-Chaker Larabi
    Multidimensional Systems and Signal Processing, 2013, 24 : 281 - 316