Exploring The Optimal Visual Vocabulary Sizes for Semantic Concept Detection

被引:0
|
作者
Guo, Jinlin [1 ]
Qiu, Zhengwei [1 ]
Gurrin, Cathal [1 ]
机构
[1] Dublin City Univ, CLARITY, Dublin 9, Ireland
关键词
FEATURES;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The framework based on the Bag-of-Visual-Words (BoVW) feature representation and SVM classification is popularly used for generic content-based concept detection or visual categorization. However, visual vocabulary (VV) size, one important factor in this framework, is always chosen differently and arbitrarily in previous work. In this paper, we focus on investigating the optimal VV sizes depending on other components of this framework which also govern the performance. This is useful as a default VV size for reducing the computation cost. By unsupervised clustering, a series of VVs covering a wide range of sizes are evaluated under two popular local features, three assignment modes, and four kernels on two different-scale benchmarking datasets respectively. These factors are also evaluated. Experimental results show that best VV sizes vary as these factors change. However, the concept detection performance usually improves as the VV size increases initially, and then gains less, or even deteriorates if larger VVs are used since overfitting occurs. Overall, VVs with sizes ranging from 1024 to 4096 achieve best performance with higher probability when compared with other-size VVs. With regard to the other factors, experimental results show that the OpponentSIFT descriptor outperforms the SURF feature, and soft assignment mode yields better performance than binary and hard assignment. In addition, generalized RBF kernels such as chi(2) and Laplace RBF kernels are more appropriate for semantic concept detection with SVM classification.
引用
收藏
页码:109 / 114
页数:6
相关论文
共 50 条
  • [31] VISUAL DETECTION BY THE ROD SYSTEM IN GOLDFISH OF DIFFERENT SIZES
    POWERS, MK
    BASSI, CJ
    RONE, LA
    RAYMOND, PA
    VISION RESEARCH, 1988, 28 (02) : 211 - 221
  • [32] SHiNe: Semantic Hierarchy Nexus for Open-vocabulary Object Detection
    Liu, Mingxuan
    Hayes, Tyler L.
    Ricci, Elisa
    Csurka, Gabriela
    Volpi, Riccardo
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 16634 - 16644
  • [33] Semantic concept and weighted visual feature based image retrieval
    Zhu, Nana
    Zhang, Huaxiang
    Kong, Wenjie
    Zhang, Huaxiang, 1600, Binary Information Press (11): : 6461 - 6469
  • [34] Visual Emotion Analysis via Affective Semantic Concept Discovery
    Zhu, Yunwen
    Zhu, Yonghua
    Ge, Ning
    Gao, Wenjing
    Zhang, Wenjun
    SCIENTIFIC PROGRAMMING, 2022, 2022
  • [35] Improving SINR with smart RIS solutions: exploring optimal dimensions and sizes
    Qeryaqos, Bushra J.
    Ayoob, Saad A.
    INTERNATIONAL JOURNAL OF MICROWAVE AND WIRELESS TECHNOLOGIES, 2025,
  • [36] A Hybrid Supervised-Unsupervised Vocabulary Generation Algorithm for Visual Concept Recognition
    Binder, Alexander
    Wojcikiewicz, Wojciech
    Mueller, Christina
    Kawanabe, Motoaki
    COMPUTER VISION - ACCV 2010, PT III, 2011, 6494 : 95 - 108
  • [37] OVIS: Open-Vocabulary Visual Instance Search via Visual-Semantic Aligned Representation Learning
    Liu, Sheng
    Lin, Kevin
    Wang, Lijuan
    Yuan, Junsong
    Liu, Zicheng
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 1773 - 1781
  • [38] Fusing audio vocabulary with visual features for pornographic video detection
    Liu, Yizhi
    Yang, Ying
    Xie, Hongtao
    Tang, Sheng
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2014, 31 : 69 - 76
  • [39] Tree Fusion Method for Semantic Concept Detection in Images
    Mansouri, Jafar
    Khademi, Morteza
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2014, E97D (08): : 2209 - 2211
  • [40] Improving Automatic Video Retrieval with Semantic Concept Detection
    Koskela, Markus
    Sjoberg, Mats
    Laaksonen, Jorma
    IMAGE ANALYSIS, PROCEEDINGS, 2009, 5575 : 480 - 489