Dissecting the effectiveness of deep features as metric of perceptual image quality

被引：0

作者：

Hernandez-Camara, Pablo ^{[1
]}

Vila-Tomas, Jorge ^{[1
]}

Laparra, Valero ^{[1
]}

Malo, Jesus ^{[1
]}

机构：

[1] Univ Valencia, Image Proc Lab, Paterna 46980, Spain

来源：

NEURAL NETWORKS | 2025年 / 185卷

关键词：

Image quality; Neural networks; Visual neuroscience; Functional principle; Learning environment; Architecture; MODELS; INFORMATION; VISION;

D O I：

10.1016/j.neunet.2025.107189

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

There is an open debate on the role of artificial networks to understand the visual brain. Internal representations of images in artificial networks develop human-like properties. In particular, evaluating distortions using differences between internal features is correlated to human perception of distortion. However, the origins of this correlation are not well understood. Here, we dissect the different factors involved in the emergence of human-like behavior: function, architecture, and environment. To do so, we evaluate the aforementioned human-network correlation at different depths of 46 pre-trained model configurations that include no psycho-visual information. The results show that most of the models correlate better with human opinion than SSIM (a de-facto standard in subjective image quality). Moreover, some models are better than state-of-the-art networks specifically tuned for the application (LPIPS, DISTS). Regarding the function, supervised classification leads to nets that correlate better with humans than the explored models for self- and non-supervised tasks. However, we found that better performance in the task does not imply more human behavior. Regarding the architecture, simpler models correlate better with humans than very deep nets and generally, the highest correlation is not achieved in the last layer. Finally, regarding the environment, training with large natural datasets leads to bigger correlations than training in smaller databases with restricted content, as expected. We also found that the best classification models are not the best for predicting human distances. In the general debate about understanding human vision, our empirical findings imply that explanations have not to be focused on a single abstraction level, but all function, architecture, and environment are relevant.

引用

页数：14

共 50 条

[11] DEEP PERCEPTUAL IMAGE QUALITY ASSESSMENT FOR COMPRESSION
Mier, Juan Carlos
Huang, Eddie
Talebi, Hossein
Yang, Feng
Milanfar, Peyman
2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 1484 - 1488
[12] Deep ensembling for perceptual image quality assessment
Nisar Ahmed
H. M. Shahzad Asif
Abdul Rauf Bhatti
Atif Khan
Soft Computing, 2022, 26 : 7601 - 7622
[13] A perceptual quality metric for image fusion based on regional information
Chen, H
Varshney, PK
Multisensor, Multisource Information Fusion: Architectures, Algorithms and Applications 2005, 2005, 5813 : 34 - 45
[14] Using Deep Perceptual Embeddings as a Quality Metric for Synthetic Imagery
Luer, William
Harguess, Josh
GEOSPATIAL INFORMATICS XI, 2021, 11733
[15] Spread Spectrum Image Watermarking Based on Perceptual Quality Metric
Zhang, Fan
Liu, Wenyu
Lin, Weisi
Ngan, King Ngi
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2011, 20 (11) : 3207 - 3218
[16] On the development of a reduced-reference perceptual image quality metric
Kusuma, TM
Zepernick, HJ
Caldera, M
2005 SYSTEMS COMMUNICATIONS, PROCEEDINGS: ICW 2005, WIRELESS TECHNOLOGIES; ICHSN 2005, HIGH SPEED NETWORKS; ICMCS 2005, MULTIMEDIA COMMUNICATIONS SYSTEMS; SENET 2005, SENSOR NETWORKS, 2005, : 178 - 184
[17] Blind image quality assessment based on statistics features and perceptual features
Zhao, Youen
Ji, Xiuhua
Liu, Zhaoguang
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 38 (03) : 3515 - 3526
[18] A Perceptual Image Quality Assessment Metric Using Singular Value Decomposition
Wang, Shuigen
Cui, Dongshun
Wang, Baoxian
Zhao, Baojun
Yang, Jinglin
CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2015, 34 (01) : 209 - 229
[19] Pseudo No Reference image quality metric using perceptual data hiding
Ninassi, Alexandre
Le Callet, Patrick
Autrusseau, Florent
HUMAN VISION AND ELECTRONIC IMAGING XI, 2006, 6057
[20] A perceptual metric for stereoscopic image quality assessment based on the binocular energy
Rafik Bensalma
Mohamed-Chaker Larabi
Multidimensional Systems and Signal Processing, 2013, 24 : 281 - 316

← 1 2 3 4 5 →