A Deeper Look at Human Visual Perception of Images

被引:0
|
作者
Fan S. [1 ]
Koenig B.L. [2 ]
Zhao Q. [3 ]
Kankanhalli M.S. [1 ]
机构
[1] School of Computing, National University of Singapore, 13 Computing Drive, Singapore
[2] Psychology Department, Southern Utah University, 351 W University Blvd, Cedar City, 84720, UT
[3] Department of Computer Science and Engineering, University of Minnesota, 200 Union St SE, Minneapolis, 55455, MN
基金
新加坡国家研究基金会;
关键词
Computational modeling; Empirical modeling; Visual sentiment;
D O I
10.1007/s42979-019-0061-5
中图分类号
学科分类号
摘要
How would one describe an image? Interesting? Pleasant? Aesthetic? A number of studies have classified images with respect to these attributes. A common approach is to link lower level image features with higher level properties, and train a computational model to perform classification using human-annotated ground truth. Although these studies generate algorithms with reasonable prediction performance, they provide few insights into why and how the algorithms work. The current study focuses on how multiple visual factors affect human perception of digital images. We extend an existing dataset with quantitative measures for human perception of 31 image attributes under 6 different viewing conditions: images that are intact, inverted, grayscale, inverted and grayscale, and images showing mainly low- or high-spatial frequency information. Statistical analyses indicate varying importance of holistic cues, color information, semantics, and saliency on different types of attributes. Building on these insights we build an empirical model of human image perception. Motivated by the empirical model, we designed computational models that predict high-level image attributes. Extensive experiments demonstrate that understanding human visual perception helps create better computational models. © 2020, Springer Nature Singapore Pte Ltd.
引用
收藏
相关论文
共 50 条