Contribution of Low, Mid and High-Level Image Features of Indoor Scenes in Predicting Human Similarity Judgements

被引：2

作者：

Mikhailova, Anastasiia ^{[1
]}

Santos-Victor, Jose ^{[1
]}

Coco, Moreno, I ^{[2
]}

机构：

[1] Univ Lisbon, Inst Super Tecn, Lisbon, Portugal

[2] Sapienza Univ Rome, Rome, Italy

来源：

PATTERN RECOGNITION AND IMAGE ANALYSIS (IBPRIA 2022) | 2022年 / 13256卷

关键词：

Image similarity; Scene semantics; Spatial envelope; SVM; Hierarchical regression;

D O I：

10.1007/978-3-031-04881-4_40

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Human judgments can still be considered the gold standard in the assessment of image similarity, but they are too expensive and time-consuming to acquire. Even though most existing computational models make almost exclusive use of low-level information to evaluate the similarity between images, human similarity judgements are known to rely on both high-level semantic and low-level visual image information. The current study aims to evaluate the impact of different types of image features on predicting human similarity judgements. We investigated how low-level (colour differences), mid-level (spatial envelope) and high-level (distributional semantics) information predict within-category human judgements of 400 indoor scenes across 4 categories in a Four-Alternative Forced Choice task in which participants had to select the most distinctive scene among four scenes presented on the screen. Linear regression analysis showed that low-level (t = 4.14, p < 0.001), mid-level (t = 3.22, p< 0.01) and high-level (t = 2.07, p < 0.04) scene information significantly predicted the probability of a scene to be selected. Additionally, the SVM model that incorporates low-mid-high level properties had 56% accuracy in predicting human similarity judgments. Our results point out: 1) the importance of including mid and high-level image properties into computational models of similarity to better characterise the cognitive mechanisms underlying human judgements, and 2) the necessity of further research in understanding how human similarity judgements are done as there is a sizeable variability in our data that it is not accounted for by the metrics we investigated.

引用

页码：505 / 514

页数：10

共 50 条

[1] Indoor Image Representation by High-Level Semantic Features
Sitaula, Chiranjibi
Xiang, Yong
Zhang, Yushu
Lu, Xuequan
Aryal, Sunil
[J]. IEEE ACCESS, 2019, 7 : 84967 - 84979
[2] Image Quality Assessment by Integration of Low-level & High-Level Features: Threshold Similarity Index
Chaudhary, Jatin
Pant, Dibakar Raj
Pokharel, Suresh
Skon, Jukka-Pekka
Heikkonen, Jukka
Kanth, Rajeev
[J]. 2022 IEEE 31ST INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS (ISIE), 2022, : 135 - 141
[3] High-level attributes modeling for indoor scenes classification
Wang, Chaojie
Yu, Jun
Tao, Dapeng
[J]. NEUROCOMPUTING, 2013, 121 : 337 - 343
[4] Texture Features for High-level Classification of Acoustic Scenes
Waldekar, Shefali
Saha, Goutam
[J]. PROCEEDINGS OF 2019 IEEE REGION 10 SYMPOSIUM (TENSYMP), 2019, : 710 - 715
[5] Image caption generation with high-level image features
Ding, Songtao
Qu, Shiru
Xi, Yuling
Sangaiah, Arun Kumar
Wan, Shaohua
[J]. PATTERN RECOGNITION LETTERS, 2019, 123 : 89 - 95
[6] Mining association rules between low-level image features and high-level concepts
Sethi, IK
Coman, IL
Stan, D
[J]. DATA MINING AND KNOWLEDGE DISCOVERY: THEORY, TOOLS AND TECHNOLOGY III, 2001, 4384 : 279 - 290
[7] Sternum image retrieval based on high-level semantic information and low-level features
Chen, Qin
Tai, Xiaoying
[J]. BMEI 2008: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND INFORMATICS, VOL 1, 2008, : 362 - 366
[8] Temporal Video Segmentation to Scenes Using High-Level Audiovisual Features
Sidiropoulos, Panagiotis
Mezaris, Vasileios
Kompatsiaris, Ioannis
Meinedo, Hugo
Bugalho, Miguel
Trancoso, Isabel
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2011, 21 (08) : 1163 - 1177
[9] Unifying Low-Level and High-Level Music Similarity Measures
Bogdanov, Dmitry
Serra, Joan
Wack, Nicolas
Herrera, Perfecto
Serra, Xavier
[J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2011, 13 (04) : 687 - 701
[10] Saliency from High-Level Semantic Image Features
Azaza A.
van de Weijer J.
Douik A.
Zolfaghari J.
Masana M.
[J]. SN Computer Science, 2020, 1 (4)

← 1 2 3 4 5 →