A Deeper Look at Human Visual Perception of Images

被引:0
|
作者
Fan S. [1 ]
Koenig B.L. [2 ]
Zhao Q. [3 ]
Kankanhalli M.S. [1 ]
机构
[1] School of Computing, National University of Singapore, 13 Computing Drive, Singapore
[2] Psychology Department, Southern Utah University, 351 W University Blvd, Cedar City, 84720, UT
[3] Department of Computer Science and Engineering, University of Minnesota, 200 Union St SE, Minneapolis, 55455, MN
基金
新加坡国家研究基金会;
关键词
Computational modeling; Empirical modeling; Visual sentiment;
D O I
10.1007/s42979-019-0061-5
中图分类号
学科分类号
摘要
How would one describe an image? Interesting? Pleasant? Aesthetic? A number of studies have classified images with respect to these attributes. A common approach is to link lower level image features with higher level properties, and train a computational model to perform classification using human-annotated ground truth. Although these studies generate algorithms with reasonable prediction performance, they provide few insights into why and how the algorithms work. The current study focuses on how multiple visual factors affect human perception of digital images. We extend an existing dataset with quantitative measures for human perception of 31 image attributes under 6 different viewing conditions: images that are intact, inverted, grayscale, inverted and grayscale, and images showing mainly low- or high-spatial frequency information. Statistical analyses indicate varying importance of holistic cues, color information, semantics, and saliency on different types of attributes. Building on these insights we build an empirical model of human image perception. Motivated by the empirical model, we designed computational models that predict high-level image attributes. Extensive experiments demonstrate that understanding human visual perception helps create better computational models. © 2020, Springer Nature Singapore Pte Ltd.
引用
收藏
相关论文
共 50 条
  • [41] Toward a Quality Predictor for Stereoscopic Images via Analysis of Human Binocular Visual Perception
    Liu, Yun
    Kong, Fanhui
    Zhen, Zhizhuo
    IEEE ACCESS, 2019, 7 : 69283 - 69291
  • [42] Document region classification using low resolution images:: a human visual perception approach
    Murguía, MIC
    Jordan, JB
    APPLICATIONS OF DIGITAL IMAGE PROCESSING XXII, 1999, 3808 : 515 - 523
  • [43] A new metric for objectively assessing the quality of enhanced images based on human visual perception
    Wang, Xiang-Hui
    Zeng, Ming
    Guangdianzi Jiguang/Journal of Optoelectronics Laser, 2008, 19 (02): : 258 - 262
  • [44] A deeper look into mental illness
    Schmidt, Charles W.
    ENVIRONMENTAL HEALTH PERSPECTIVES, 2007, 115 (08) : A404 - A410
  • [45] A deeper look into argyrodite phonons
    de Boissieu, M.
    NATURE MATERIALS, 2023, 22 (08) : 931 - 932
  • [46] REPLY: LOOK DEEPER INTO THROMBOCYTOPENIA
    Miceli, Antonio
    JOURNAL OF THORACIC AND CARDIOVASCULAR SURGERY, 2021, 161 (01): : E20 - E20
  • [47] A deeper look into argyrodite phonons
    M. de Boissieu
    Nature Materials, 2023, 22 : 931 - 932
  • [48] A DEEPER LOOK - AT TRANSLATING ACTIONS
    MELLOR, SJ
    JOURNAL OF OBJECT-ORIENTED PROGRAMMING, 1995, 7 (08): : 28 - 31
  • [49] A Deeper Look at Power Normalizations
    Koniusz, Piotr
    Zhang, Hongguang
    Porikli, Fatih
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5774 - 5783
  • [50] A Deeper Look at Hyperbolic Discounting
    Sopher, Barry
    Sheth, Arnav
    UNCERTAINTY AND RISK: MENTAL, FORMAL, EXPERIMENTAL REPRESENTATIONS, 2007, 41 : 125 - 150