Exploring The Optimal Visual Vocabulary Sizes for Semantic Concept Detection

被引:0
|
作者
Guo, Jinlin [1 ]
Qiu, Zhengwei [1 ]
Gurrin, Cathal [1 ]
机构
[1] Dublin City Univ, CLARITY, Dublin 9, Ireland
关键词
FEATURES;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The framework based on the Bag-of-Visual-Words (BoVW) feature representation and SVM classification is popularly used for generic content-based concept detection or visual categorization. However, visual vocabulary (VV) size, one important factor in this framework, is always chosen differently and arbitrarily in previous work. In this paper, we focus on investigating the optimal VV sizes depending on other components of this framework which also govern the performance. This is useful as a default VV size for reducing the computation cost. By unsupervised clustering, a series of VVs covering a wide range of sizes are evaluated under two popular local features, three assignment modes, and four kernels on two different-scale benchmarking datasets respectively. These factors are also evaluated. Experimental results show that best VV sizes vary as these factors change. However, the concept detection performance usually improves as the VV size increases initially, and then gains less, or even deteriorates if larger VVs are used since overfitting occurs. Overall, VVs with sizes ranging from 1024 to 4096 achieve best performance with higher probability when compared with other-size VVs. With regard to the other factors, experimental results show that the OpponentSIFT descriptor outperforms the SURF feature, and soft assignment mode yields better performance than binary and hard assignment. In addition, generalized RBF kernels such as chi(2) and Laplace RBF kernels are more appropriate for semantic concept detection with SVM classification.
引用
收藏
页码:109 / 114
页数:6
相关论文
共 50 条
  • [41] Robust Semantic Concept Detection in Large Video Collections
    Shen, Jialie
    Tao, Dacheng
    Li, Xuelong
    2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 635 - +
  • [42] A Comprehensive Study of Feature Representations for Semantic Concept Detection
    Duy-Dinh Le
    Satoh, Shin'ichi
    FIFTH IEEE INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC 2011), 2011, : 235 - 238
  • [43] AHP: A new strategy for the semantic concept detection in video
    Ding, Dayong
    Zhang, Bo
    Wu, Jinglan
    2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, 2007, : 1974 - +
  • [44] Semantic Concept Detection Using Dense Codeword Motion
    Tanase, Claudiu
    Merialdo, Bernard
    ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS, ACIVS 2013, 2013, 8192 : 705 - 713
  • [45] VIDEO SEMANTIC CONCEPT DETECTION VIA ASSOCIATIVE CLASSIFICATION
    Lin, Lin
    Shyu, Mei-Ling
    Ravitz, Guy
    Chen, Shu-Ching
    ICME: 2009 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-3, 2009, : 418 - +
  • [46] Visual Relationship Detection Using Joint Visual-Semantic Embedding
    Li, Binglin
    Wang, Yang
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 3291 - 3296
  • [47] Optimal detection of visual evoked potentials
    Davila, CE
    Srebro, R
    Ghaleb, IA
    IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 1998, 45 (06) : 800 - 803
  • [48] Open-Vocabulary Animal Keypoint Detection with Semantic-Feature Matching
    Zhang, Hao
    Xu, Lumin
    Lai, Shenqi
    Shao, Wenqi
    Zheng, Nanning
    Luo, Ping
    Qiao, Yu
    Zhang, Kaipeng
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2024, 132 (12) : 5741 - 5758
  • [49] Exploring Open-Vocabulary Semantic Segmentation from CLIP Vision Encoder Distillation Only
    Chen, Jun
    Zhu, Deyao
    Qian, Guocheng
    Ghanem, Bernard
    Yan, Zhicheng
    Zhu, Chenchen
    Xiao, Fanyi
    Culatana, Sean Chang
    Elhoseiny, Mohamed
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION, ICCV, 2023, : 699 - 710
  • [50] Online Detection of Concept Drift in Visual Tracking
    Liu, Yichen
    Zhou, Yue
    NEURAL INFORMATION PROCESSING, ICONIP 2014, PT III, 2014, 8836 : 159 - 166