Exploring The Optimal Visual Vocabulary Sizes for Semantic Concept Detection

被引:0
|
作者
Guo, Jinlin [1 ]
Qiu, Zhengwei [1 ]
Gurrin, Cathal [1 ]
机构
[1] Dublin City Univ, CLARITY, Dublin 9, Ireland
关键词
FEATURES;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The framework based on the Bag-of-Visual-Words (BoVW) feature representation and SVM classification is popularly used for generic content-based concept detection or visual categorization. However, visual vocabulary (VV) size, one important factor in this framework, is always chosen differently and arbitrarily in previous work. In this paper, we focus on investigating the optimal VV sizes depending on other components of this framework which also govern the performance. This is useful as a default VV size for reducing the computation cost. By unsupervised clustering, a series of VVs covering a wide range of sizes are evaluated under two popular local features, three assignment modes, and four kernels on two different-scale benchmarking datasets respectively. These factors are also evaluated. Experimental results show that best VV sizes vary as these factors change. However, the concept detection performance usually improves as the VV size increases initially, and then gains less, or even deteriorates if larger VVs are used since overfitting occurs. Overall, VVs with sizes ranging from 1024 to 4096 achieve best performance with higher probability when compared with other-size VVs. With regard to the other factors, experimental results show that the OpponentSIFT descriptor outperforms the SURF feature, and soft assignment mode yields better performance than binary and hard assignment. In addition, generalized RBF kernels such as chi(2) and Laplace RBF kernels are more appropriate for semantic concept detection with SVM classification.
引用
收藏
页码:109 / 114
页数:6
相关论文
共 50 条
  • [21] Exploiting concept association to boost multimedia semantic concept detection
    Gao, Sheng
    Zhu, Xinglei
    Sun, Qibin
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PTS 1-3, PROCEEDINGS, 2007, : 981 - 984
  • [22] Semantic information processing of physical simulation based on Scientific Concept Vocabulary model
    Kino C.
    Suzuki Y.
    Takemiya H.
    IEEJ Transactions on Electronics, Information and Systems, 2010, 130 (07) : 1228 - 1237+19
  • [23] Exploring visual culture: Definitions, concept, contexts
    Duncum, Paul
    JOURNAL OF AESTHETIC EDUCATION, 2008, 42 (01): : 121 - 123
  • [24] Exploring the concept of optimal functionality in old age
    Algilani, Samal
    Ostlund-Lagerstrom, Lina
    Kihlgren, Annica
    Blomberg, Karin
    Brummer, Robert J.
    Schoultz, Ida
    JOURNAL OF MULTIDISCIPLINARY HEALTHCARE, 2014, 7 : 69 - 79
  • [25] Classifier optimization for multimedia semantic concept detection
    Gao, Sheng
    Sun, Qibin
    2006 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO - ICME 2006, VOLS 1-5, PROCEEDINGS, 2006, : 1489 - +
  • [26] A Novel Semantic Model for Video Concept Detection
    Zhu, Songhao
    Liu, Yuncai
    2009 16TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-6, 2009, : 1837 - +
  • [27] Region Trajectories for Video Semantic Concept Detection
    Ye, Yuancheng
    Rong, Xuejian
    Yang, Xiaodong
    Tian, Yingli
    ICMR'16: PROCEEDINGS OF THE 2016 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2016, : 255 - 259
  • [28] SEMANTIC EFFECTS IN VISUAL WORD DETECTION WITH VISUAL SIMILARITY CONTROLLED
    HENDERSON, L
    CHARD, J
    PERCEPTION & PSYCHOPHYSICS, 1978, 23 (04): : 290 - 298
  • [29] ATTENTIVE SEMANTIC EXPLORING FOR MANIPULATED FACE DETECTION
    Chen, Zehao
    Yang, Hua
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 1985 - 1989
  • [30] Exploring the Potentiality of Semantic Features for Paraphrase Detection
    Anchieta, Rafael Torres
    Salgueiro Pardo, Thiago Alexandre
    COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROPOR 2020, 2020, 12037 : 228 - 238