Exploring The Optimal Visual Vocabulary Sizes for Semantic Concept Detection

被引:0
|
作者
Guo, Jinlin [1 ]
Qiu, Zhengwei [1 ]
Gurrin, Cathal [1 ]
机构
[1] Dublin City Univ, CLARITY, Dublin 9, Ireland
关键词
FEATURES;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The framework based on the Bag-of-Visual-Words (BoVW) feature representation and SVM classification is popularly used for generic content-based concept detection or visual categorization. However, visual vocabulary (VV) size, one important factor in this framework, is always chosen differently and arbitrarily in previous work. In this paper, we focus on investigating the optimal VV sizes depending on other components of this framework which also govern the performance. This is useful as a default VV size for reducing the computation cost. By unsupervised clustering, a series of VVs covering a wide range of sizes are evaluated under two popular local features, three assignment modes, and four kernels on two different-scale benchmarking datasets respectively. These factors are also evaluated. Experimental results show that best VV sizes vary as these factors change. However, the concept detection performance usually improves as the VV size increases initially, and then gains less, or even deteriorates if larger VVs are used since overfitting occurs. Overall, VVs with sizes ranging from 1024 to 4096 achieve best performance with higher probability when compared with other-size VVs. With regard to the other factors, experimental results show that the OpponentSIFT descriptor outperforms the SURF feature, and soft assignment mode yields better performance than binary and hard assignment. In addition, generalized RBF kernels such as chi(2) and Laplace RBF kernels are more appropriate for semantic concept detection with SVM classification.
引用
收藏
页码:109 / 114
页数:6
相关论文
共 50 条
  • [11] Using semantic features to improve large-scale visual concept detection
    Sjoberg, Mats
    Laaksonen, Jorma
    2014 12TH INTERNATIONAL WORKSHOP ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI), 2014,
  • [12] CONCEPT DIFFERENTIATION BY SEMANTIC AND VISUAL MEDIATION
    HOFMAN, JE
    MIKHAELOVICZ, R
    PSYCHOLOGICAL REPORTS, 1975, 36 (02) : 575 - 578
  • [13] Toward a Visual Concept Vocabulary for GAN Latent Space
    Schwettmann, Sarah
    Hernandez, Evan
    Bau, David
    Klein, Samuel
    Andreas, Jacob
    Torralba, Antonio
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 6784 - 6792
  • [14] Searching visual semantic spaces with concept filters
    Zavesky, Eric
    Liu, Zhu
    Gibbon, David
    Shahraray, Behzad
    ICSC 2007: INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING, PROCEEDINGS, 2007, : 329 - +
  • [15] Exploring semantic groups through visual approaches
    Bodenreider, O
    McCray, AT
    JOURNAL OF BIOMEDICAL INFORMATICS, 2003, 36 (06) : 414 - 432
  • [16] Concept-Specific Visual Vocabulary Construction for Object Categorization
    Zhang, Chunjie
    Liu, Jing
    Ouyang, Yi
    Lu, Hanqing
    Ma, Songde
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2009, 2009, 5879 : 936 - 942
  • [17] Semantic-enriched visual vocabulary construction in a weakly supervised context
    Rizoiu, Marian-Andrei
    Velcin, Julien
    Lallich, Stephane
    INTELLIGENT DATA ANALYSIS, 2015, 19 (01) : 161 - 185
  • [18] In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation
    Kang, Dahyun
    Cho, Minsu
    COMPUTER VISION - ECCV 2024, PT XLI, 2025, 15099 : 143 - 164
  • [19] Multi-Scale Image Semantic Recognition with Hierarchical Visual Vocabulary
    Jiang, Xinghao
    Sun, Tanfeng
    Fu, GuangLei
    COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2011, 8 (03) : 931 - 951
  • [20] Exploring Long Tail Visual Relationship Recognition with Large Vocabulary
    Abdelkarim, Sherif
    Agarwal, Aniket
    Achlioptas, Panos
    Chen, Jun
    Huang, Jiaji
    Li, Boyang
    Church, Kenneth
    Elhoseiny, Mohamed
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 15901 - 15910