Exploring The Optimal Visual Vocabulary Sizes for Semantic Concept Detection

被引:0
|
作者
Guo, Jinlin [1 ]
Qiu, Zhengwei [1 ]
Gurrin, Cathal [1 ]
机构
[1] Dublin City Univ, CLARITY, Dublin 9, Ireland
关键词
FEATURES;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The framework based on the Bag-of-Visual-Words (BoVW) feature representation and SVM classification is popularly used for generic content-based concept detection or visual categorization. However, visual vocabulary (VV) size, one important factor in this framework, is always chosen differently and arbitrarily in previous work. In this paper, we focus on investigating the optimal VV sizes depending on other components of this framework which also govern the performance. This is useful as a default VV size for reducing the computation cost. By unsupervised clustering, a series of VVs covering a wide range of sizes are evaluated under two popular local features, three assignment modes, and four kernels on two different-scale benchmarking datasets respectively. These factors are also evaluated. Experimental results show that best VV sizes vary as these factors change. However, the concept detection performance usually improves as the VV size increases initially, and then gains less, or even deteriorates if larger VVs are used since overfitting occurs. Overall, VVs with sizes ranging from 1024 to 4096 achieve best performance with higher probability when compared with other-size VVs. With regard to the other factors, experimental results show that the OpponentSIFT descriptor outperforms the SURF feature, and soft assignment mode yields better performance than binary and hard assignment. In addition, generalized RBF kernels such as chi(2) and Laplace RBF kernels are more appropriate for semantic concept detection with SVM classification.
引用
收藏
页码:109 / 114
页数:6
相关论文
共 50 条
  • [1] A framework for moderate vocabulary semantic visual concept detection
    Naphade, MR
    Lin, CY
    Natsev, A
    Tseng, BL
    Smith, JR
    2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I, PROCEEDINGS, 2003, : 437 - 440
  • [2] Exploring semantic dependencies for scalable concept detection
    Natsev, A
    Naphade, MR
    Smith, JR
    2003 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL 3, PROCEEDINGS, 2003, : 625 - 628
  • [3] Visual Vocabulary with a Semantic Twist
    Arandjelovic, Relja
    Zisserman, Andrew
    COMPUTER VISION - ACCV 2014, PT I, 2015, 9003 : 178 - 195
  • [4] Normalized classifier fusion for semantic visual concept detection
    Tseng, BL
    Lin, CY
    Naphade, M
    Natsev, A
    Smith, JR
    2003 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL 2, PROCEEDINGS, 2003, : 535 - 538
  • [5] Towards Semantic Embedding in Visual Vocabulary
    Ji, Rongrong
    Yao, Hongxun
    Sun, Xiaoshuai
    Zhong, Bineng
    Gao, Wen
    2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 918 - 925
  • [6] Exploring the dynamics of visual events in the multi-dimensional semantic concept space
    Ebadollahi, Shahram
    Xie, Lexing
    Abreu, Andres
    Podlaseck, Mark
    Chang, Shih-Fu
    Smith, John R.
    IMAGE AND VIDEO RETRIEVAL, PROCEEDINGS, 2006, 4071 : 503 - 505
  • [7] Exploring the Relationship Between Visual Information and Language Semantic Concept in the Human Brain
    Jing, Haodong
    Du, Ming
    Ma, Yongqiang
    Zheng, Nanning
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2022, PART I, 2022, 646 : 394 - 406
  • [8] Towards a more discriminative and semantic visual vocabulary
    Lopez-Sastre, R. J.
    Tuytelaars, T.
    Acevedo-Rodriguez, F. J.
    Maldonado-Bascon, S.
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2011, 115 (03) : 415 - 425
  • [9] Semantic Bag-of-Words Models for Visual Concept Detection and Annotation
    Zhang, Yu
    Bres, Stphane
    Chen, Liming
    8TH INTERNATIONAL CONFERENCE ON SIGNAL IMAGE TECHNOLOGY & INTERNET BASED SYSTEMS (SITIS 2012), 2012, : 289 - 295
  • [10] Large-Scale Semantic Concept Detection Based On Visual Contents
    Hamroun, Mohamed
    Lajmi, Sonia
    Nicolas, Henri
    Amous, Ikram
    17TH INTERNATIONAL CONFERENCE ON ADVANCES IN MOBILE COMPUTING & MULTIMEDIA (MOMM2019), 2019, : 165 - 174