Contribution of Low, Mid and High-Level Image Features of Indoor Scenes in Predicting Human Similarity Judgements

被引:2
|
作者
Mikhailova, Anastasiia [1 ]
Santos-Victor, Jose [1 ]
Coco, Moreno, I [2 ]
机构
[1] Univ Lisbon, Inst Super Tecn, Lisbon, Portugal
[2] Sapienza Univ Rome, Rome, Italy
关键词
Image similarity; Scene semantics; Spatial envelope; SVM; Hierarchical regression;
D O I
10.1007/978-3-031-04881-4_40
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Human judgments can still be considered the gold standard in the assessment of image similarity, but they are too expensive and time-consuming to acquire. Even though most existing computational models make almost exclusive use of low-level information to evaluate the similarity between images, human similarity judgements are known to rely on both high-level semantic and low-level visual image information. The current study aims to evaluate the impact of different types of image features on predicting human similarity judgements. We investigated how low-level (colour differences), mid-level (spatial envelope) and high-level (distributional semantics) information predict within-category human judgements of 400 indoor scenes across 4 categories in a Four-Alternative Forced Choice task in which participants had to select the most distinctive scene among four scenes presented on the screen. Linear regression analysis showed that low-level (t = 4.14, p < 0.001), mid-level (t = 3.22, p< 0.01) and high-level (t = 2.07, p < 0.04) scene information significantly predicted the probability of a scene to be selected. Additionally, the SVM model that incorporates low-mid-high level properties had 56% accuracy in predicting human similarity judgments. Our results point out: 1) the importance of including mid and high-level image properties into computational models of similarity to better characterise the cognitive mechanisms underlying human judgements, and 2) the necessity of further research in understanding how human similarity judgements are done as there is a sizeable variability in our data that it is not accounted for by the metrics we investigated.
引用
收藏
页码:505 / 514
页数:10
相关论文
共 50 条
  • [1] Indoor Image Representation by High-Level Semantic Features
    Sitaula, Chiranjibi
    Xiang, Yong
    Zhang, Yushu
    Lu, Xuequan
    Aryal, Sunil
    [J]. IEEE ACCESS, 2019, 7 : 84967 - 84979
  • [2] Image Quality Assessment by Integration of Low-level & High-Level Features: Threshold Similarity Index
    Chaudhary, Jatin
    Pant, Dibakar Raj
    Pokharel, Suresh
    Skon, Jukka-Pekka
    Heikkonen, Jukka
    Kanth, Rajeev
    [J]. 2022 IEEE 31ST INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS (ISIE), 2022, : 135 - 141
  • [3] High-level attributes modeling for indoor scenes classification
    Wang, Chaojie
    Yu, Jun
    Tao, Dapeng
    [J]. NEUROCOMPUTING, 2013, 121 : 337 - 343
  • [4] Texture Features for High-level Classification of Acoustic Scenes
    Waldekar, Shefali
    Saha, Goutam
    [J]. PROCEEDINGS OF 2019 IEEE REGION 10 SYMPOSIUM (TENSYMP), 2019, : 710 - 715
  • [5] Image caption generation with high-level image features
    Ding, Songtao
    Qu, Shiru
    Xi, Yuling
    Sangaiah, Arun Kumar
    Wan, Shaohua
    [J]. PATTERN RECOGNITION LETTERS, 2019, 123 : 89 - 95
  • [6] Mining association rules between low-level image features and high-level concepts
    Sethi, IK
    Coman, IL
    Stan, D
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY: THEORY, TOOLS AND TECHNOLOGY III, 2001, 4384 : 279 - 290
  • [7] Sternum image retrieval based on high-level semantic information and low-level features
    Chen, Qin
    Tai, Xiaoying
    [J]. BMEI 2008: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND INFORMATICS, VOL 1, 2008, : 362 - 366
  • [8] Temporal Video Segmentation to Scenes Using High-Level Audiovisual Features
    Sidiropoulos, Panagiotis
    Mezaris, Vasileios
    Kompatsiaris, Ioannis
    Meinedo, Hugo
    Bugalho, Miguel
    Trancoso, Isabel
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2011, 21 (08) : 1163 - 1177
  • [9] Unifying Low-Level and High-Level Music Similarity Measures
    Bogdanov, Dmitry
    Serra, Joan
    Wack, Nicolas
    Herrera, Perfecto
    Serra, Xavier
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2011, 13 (04) : 687 - 701
  • [10] Saliency from High-Level Semantic Image Features
    Azaza A.
    van de Weijer J.
    Douik A.
    Zolfaghari J.
    Masana M.
    [J]. SN Computer Science, 2020, 1 (4)