Exploring The Optimal Visual Vocabulary Sizes for Semantic Concept Detection

被引：0

作者：

Guo, Jinlin ^{[1
]}

Qiu, Zhengwei ^{[1
]}

Gurrin, Cathal ^{[1
]}

机构：

[1] Dublin City Univ, CLARITY, Dublin 9, Ireland

来源：

2013 11TH INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI 2013) | 2013年

关键词：

FEATURES;

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

The framework based on the Bag-of-Visual-Words (BoVW) feature representation and SVM classification is popularly used for generic content-based concept detection or visual categorization. However, visual vocabulary (VV) size, one important factor in this framework, is always chosen differently and arbitrarily in previous work. In this paper, we focus on investigating the optimal VV sizes depending on other components of this framework which also govern the performance. This is useful as a default VV size for reducing the computation cost. By unsupervised clustering, a series of VVs covering a wide range of sizes are evaluated under two popular local features, three assignment modes, and four kernels on two different-scale benchmarking datasets respectively. These factors are also evaluated. Experimental results show that best VV sizes vary as these factors change. However, the concept detection performance usually improves as the VV size increases initially, and then gains less, or even deteriorates if larger VVs are used since overfitting occurs. Overall, VVs with sizes ranging from 1024 to 4096 achieve best performance with higher probability when compared with other-size VVs. With regard to the other factors, experimental results show that the OpponentSIFT descriptor outperforms the SURF feature, and soft assignment mode yields better performance than binary and hard assignment. In addition, generalized RBF kernels such as chi(2) and Laplace RBF kernels are more appropriate for semantic concept detection with SVM classification.

引用

页码：109 / 114

页数：6

共 50 条

[31] VISUAL DETECTION BY THE ROD SYSTEM IN GOLDFISH OF DIFFERENT SIZES
POWERS, MK
BASSI, CJ
RONE, LA
RAYMOND, PA
VISION RESEARCH, 1988, 28 (02) : 211 - 221
[32] SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object Detection
Liu, Mingxuan
Hayes, Tyler L.
Ricci, Elisa
Csurka, Gabriela
Volpi, Riccardo
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 16634 - 16644
[33] Semantic concept and weighted visual feature based image retrieval
Zhu, Nana
Zhang, Huaxiang
Kong, Wenjie
Zhang, Huaxiang, 1600, Binary Information Press (11): : 6461 - 6469
[34] Visual Emotion Analysis via Affective Semantic Concept Discovery
Zhu, Yunwen
Zhu, Yonghua
Ge, Ning
Gao, Wenjing
Zhang, Wenjun
SCIENTIFIC PROGRAMMING, 2022, 2022
[35] Improving SINR with smart RIS solutions: exploring optimal dimensions and sizes
Qeryaqos, Bushra J.
Ayoob, Saad A.
INTERNATIONAL JOURNAL OF MICROWAVE AND WIRELESS TECHNOLOGIES, 2025,
[36] A Hybrid Supervised-Unsupervised Vocabulary Generation Algorithm for Visual Concept Recognition
Binder, Alexander
Wojcikiewicz, Wojciech
Mueller, Christina
Kawanabe, Motoaki
COMPUTER VISION - ACCV 2010, PT III, 2011, 6494 : 95 - 108
[37] OVIS: Open-Vocabulary Visual Instance Search via Visual-Semantic Aligned Representation Learning
Liu, Sheng
Lin, Kevin
Wang, Lijuan
Yuan, Junsong
Liu, Zicheng
THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 1773 - 1781
[38] Fusing audio vocabulary with visual features for pornographic video detection
Liu, Yizhi
Yang, Ying
Xie, Hongtao
Tang, Sheng
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2014, 31 : 69 - 76
[39] Tree Fusion Method for Semantic Concept Detection in Images
Mansouri, Jafar
Khademi, Morteza
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2014, E97D (08): : 2209 - 2211
[40] Improving Automatic Video Retrieval with Semantic Concept Detection
Koskela, Markus
Sjoberg, Mats
Laaksonen, Jorma
IMAGE ANALYSIS, PROCEEDINGS, 2009, 5575 : 480 - 489

← 1 2 3 4 5 →