Information-theoretic model comparison unifies saliency metrics

被引:104
|
作者
Kuemmerer, Matthias [1 ]
Wallis, Thomas S. A. [1 ,2 ]
Bethge, Matthias [1 ,3 ,4 ]
机构
[1] Univ Tubingen, Werner Reichardt Ctr Integrat Neurosci, D-72076 Tubingen, Germany
[2] Univ Tubingen, Dept Comp Sci, D-72076 Tubingen, Germany
[3] Bernstein Ctr Computat Neurosci, D-72076 Tubingen, Germany
[4] Max Planck Inst Biol Cybernet, D-72076 Tubingen, Germany
关键词
visual attention; eye movements; probabilistic modeling; likelihood; point processes; EYE-MOVEMENTS; FIXATION SELECTION; ATTENTION; GUIDANCE; FEATURES; SEARCH; SCENES;
D O I
10.1073/pnas.1510393112
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Learning the properties of an image associated with human gaze placement is important both for understanding how biological systems explore the environment and for computer vision applications. There is a large literature on quantitative eye movement models that seeks to predict fixations from images (sometimes termed "saliency" prediction). A major problem known to the field is that existing model comparison metrics give inconsistent results, causing confusion. We argue that the primary reason for these inconsistencies is because different metrics and models use different definitions of what a "saliency map" entails. For example, some metrics expect a model to account for image-independent central fixation bias whereas others will penalize a model that does. Here we bring saliency evaluation into the domain of information by framing fixation prediction models probabilistically and calculating information gain. We jointly optimize the scale, the center bias, and spatial blurring of all models within this framework. Evaluating existing metrics on these rephrased models produces almost perfect agreement in model rankings across the metrics. Model performance is separated from center bias and spatial blurring, avoiding the confounding of these factors in model comparison. We additionally provide a method to show where and how models fail to capture information in the fixations on the pixel level. These methods are readily extended to spatiotemporal models of fixation scan-paths, and we provide a software package to facilitate their use.
引用
收藏
页码:16054 / 16059
页数:6
相关论文
共 50 条
  • [31] Information-theoretic logic
    Corcoran, J
    [J]. TRUTH IN PERSPECTIVE: RECENT ISSUES IN LOGIC, REPRESENTATION AND ONTOLOGY, 1998, : 113 - 135
  • [32] Information-Theoretic Adverbialism
    Gert, Joshua
    [J]. AUSTRALASIAN JOURNAL OF PHILOSOPHY, 2021, 99 (04) : 696 - 715
  • [33] The information-theoretic turn
    Blevins, James P.
    [J]. PSIHOLOGIJA, 2013, 46 (04) : 355 - 375
  • [34] Information-Theoretic Applications of the Logarithmic Probability Comparison Bound
    Atar, Rami
    Merhav, Neri
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2015, 61 (10) : 5366 - 5386
  • [35] Information density converges in dialogue: Towards an information-theoretic model
    Xu, Yang
    Reitter, David
    [J]. COGNITION, 2018, 170 : 147 - 163
  • [36] API-based and information-theoretic metrics for measuring the quality of software modularization
    Sarkar, Santonu
    Rama, Girish Maskeri
    Kak, Avinash C.
    [J]. IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2007, 33 (01) : 14 - 32
  • [37] Comparison between Bayesian and information-theoretic model averaging: Fossil fuels prices example
    Drachal, Krzysztof
    [J]. ENERGY ECONOMICS, 2018, 74 : 208 - 251
  • [38] Class-specific feature selection using fuzzy information-theoretic metrics
    Ma, Xi-Ao
    Xu, Hao
    Liu, Yi
    Zhang, Justin Zuopeng
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 136
  • [39] Information-Theoretic Analysis of the Dynamics of an Executable Biological Model
    Sadot, Avital
    Sarbu, Septimia
    Kesseli, Juha
    Amir-Kroll, Hila
    Zhang, Wei
    Nykter, Matti
    Shmulevich, Ilya
    [J]. PLOS ONE, 2013, 8 (03):
  • [40] An information-theoretic model for image watermarking and data hiding
    Moulin, P
    Mihçak, MK
    Lin, GI
    [J]. 2000 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL III, PROCEEDINGS, 2000, : 667 - 670