Information-theoretic model comparison unifies saliency metrics

被引：104

作者：

Kuemmerer, Matthias ^{[1
]}

Wallis, Thomas S. A. ^{[1
,2
]}

Bethge, Matthias ^{[1
,3
,4
]}

机构：

[1] Univ Tubingen, Werner Reichardt Ctr Integrat Neurosci, D-72076 Tubingen, Germany

[2] Univ Tubingen, Dept Comp Sci, D-72076 Tubingen, Germany

[3] Bernstein Ctr Computat Neurosci, D-72076 Tubingen, Germany

[4] Max Planck Inst Biol Cybernet, D-72076 Tubingen, Germany

来源：

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA | 2015年 / 112卷 / 52期

关键词：

visual attention; eye movements; probabilistic modeling; likelihood; point processes; EYE-MOVEMENTS; FIXATION SELECTION; ATTENTION; GUIDANCE; FEATURES; SEARCH; SCENES;

D O I：

10.1073/pnas.1510393112

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Learning the properties of an image associated with human gaze placement is important both for understanding how biological systems explore the environment and for computer vision applications. There is a large literature on quantitative eye movement models that seeks to predict fixations from images (sometimes termed "saliency" prediction). A major problem known to the field is that existing model comparison metrics give inconsistent results, causing confusion. We argue that the primary reason for these inconsistencies is because different metrics and models use different definitions of what a "saliency map" entails. For example, some metrics expect a model to account for image-independent central fixation bias whereas others will penalize a model that does. Here we bring saliency evaluation into the domain of information by framing fixation prediction models probabilistically and calculating information gain. We jointly optimize the scale, the center bias, and spatial blurring of all models within this framework. Evaluating existing metrics on these rephrased models produces almost perfect agreement in model rankings across the metrics. Model performance is separated from center bias and spatial blurring, avoiding the confounding of these factors in model comparison. We additionally provide a method to show where and how models fail to capture information in the fixations on the pixel level. These methods are readily extended to spatiotemporal models of fixation scan-paths, and we provide a software package to facilitate their use.

引用

页码：16054 / 16059

页数：6

共 50 条

[31] Information-theoretic logic
Corcoran, J
[J]. TRUTH IN PERSPECTIVE: RECENT ISSUES IN LOGIC, REPRESENTATION AND ONTOLOGY, 1998, : 113 - 135
[32] Information-Theoretic Adverbialism
Gert, Joshua
[J]. AUSTRALASIAN JOURNAL OF PHILOSOPHY, 2021, 99 (04) : 696 - 715
[33] The information-theoretic turn
Blevins, James P.
[J]. PSIHOLOGIJA, 2013, 46 (04) : 355 - 375
[34] Information-Theoretic Applications of the Logarithmic Probability Comparison Bound
Atar, Rami
Merhav, Neri
[J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2015, 61 (10) : 5366 - 5386
[35] Information density converges in dialogue: Towards an information-theoretic model
Xu, Yang
Reitter, David
[J]. COGNITION, 2018, 170 : 147 - 163
[36] API-based and information-theoretic metrics for measuring the quality of software modularization
Sarkar, Santonu
Rama, Girish Maskeri
Kak, Avinash C.
[J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2007, 33 (01) : 14 - 32
[37] Comparison between Bayesian and information-theoretic model averaging: Fossil fuels prices example
Drachal, Krzysztof
[J]. ENERGY ECONOMICS, 2018, 74 : 208 - 251
[38] Class-specific feature selection using fuzzy information-theoretic metrics
Ma, Xi-Ao
Xu, Hao
Liu, Yi
Zhang, Justin Zuopeng
[J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 136
[39] Information-Theoretic Analysis of the Dynamics of an Executable Biological Model
Sadot, Avital
Sarbu, Septimia
Kesseli, Juha
Amir-Kroll, Hila
Zhang, Wei
Nykter, Matti
Shmulevich, Ilya
[J]. PLOS ONE, 2013, 8 (03):
[40] An information-theoretic model for image watermarking and data hiding
Moulin, P
Mihçak, MK
Lin, GI
[J]. 2000 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL III, PROCEEDINGS, 2000, : 667 - 670

← 1 2 3 4 5 →