Early versus Late Dimensionality Reduction of Bag-of-Words Feature Representation for Image Classification

被引:0
|
作者
Tsai, Chih-Fong [1 ]
Hu, Ya-Han [2 ]
Lin, Wei-Chao [3 ]
Wang, Ming-Chang [4 ]
机构
[1] Natl Cent Univ, Dept Informat Management, Taoyuan, Taiwan
[2] Natl Chung Cheng Univ, Dept Informat Management, Chiayi, Taiwan
[3] Chang Gung Univ, Dept Informat Management, Taoyuan, Taiwan
[4] Natl Chung Cheng Univ, Dept Business Adm, Chiayi, Taiwan
关键词
Dimensionality reduction; feature selection; bag-of-words; image classification; principal component analysis;
D O I
10.1145/3175587.3175598
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Extracting the bag-of-words (BoW) feature from images has been widely used for image classification. In general, some local keypoints are first of all detected from each image and the keypoint descriptor, such as scale-invariant feature transform (SIFT), is extracted. Then, the keypoint descriptors of a given image dataset are tokenized (or clustered) to generate a visual-word vocabulary (or codebook). Next, the visual-word vector of an image contains the presence or absence information of each visual word in the image, e.g. the number of keypoints in the corresponding cluster, i.e. visual word. Consequently, images are represented by a histogram over visual words. Since the dimensionalities of the SIFT keypoint descriptor and the final BoW feature for image classification are certainly high, this paper aims at examining the effect of performing dimensionality reduction (DR) for both different features on classification accuracy. In particular, early DR is used over the SIFT descriptor and late DR for the BoW feature. The experimental results based on Caltech 101 (2-D images) and ESB (3-D images) datasets show that reducing 50% dimensionality of the SIFT descriptor by PCA can allow the SVM classifier to perform similar to the one without DR. On the other hand, late DR only works for 2-D images, but the classification performance of SVM cannot be kept if over 25% dimensionality of the BoW feature is reduced..
引用
收藏
页码:42 / 45
页数:4
相关论文
共 50 条
  • [1] Importance of feature locations in bag-of-words image classification
    Lazic, Nevena
    Aarabi, Parham
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PTS 1-3, PROCEEDINGS, 2007, : 641 - 644
  • [2] Image classification by visual bag-of-words refinement and reduction
    Lu, Zhiwu
    Wang, Liwei
    Wen, Ji-Rong
    [J]. NEUROCOMPUTING, 2016, 173 : 373 - 384
  • [3] Keypoint selection for efficient bag-of-words feature generation and effective image classification
    Lin, Wei-Chao
    Tsai, Chih-Fong
    Chen, Zong-Yao
    Ke, Shih-Wen
    [J]. INFORMATION SCIENCES, 2016, 329 : 33 - 51
  • [4] Bag-of-words representation for biomedical time series classification
    Wang, Jin
    Liu, Ping
    She, Mary F. H.
    Nahavandi, Saeid
    Kouzani, Abbas
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2013, 8 (06) : 634 - 644
  • [5] Bag-of-words feature representation for blind image quality assessment with local quantized pattern
    Xie, Xuemei
    Zhang, Yazhong
    Wu, Jinjian
    Shi, Guangming
    Dong, Weisheng
    [J]. NEUROCOMPUTING, 2017, 266 : 176 - 187
  • [6] The influence of preprocessing on text classification using a bag-of-words representation
    HaCohen-Kerner, Yaakov
    Miller, Daniel
    Yigal, Yair
    [J]. PLOS ONE, 2020, 15 (05):
  • [7] Visual Attention based Bag-of-Words Model for Image Classification
    Wang, Qiwei
    Wan, Shouhong
    Yue, Lihua
    Wang, Che
    [J]. 6TH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2014), 2014, 9159
  • [8] How to use Bag-of-Words model better for image classification
    Wang, Chong
    Huang, Kaiqi
    [J]. IMAGE AND VISION COMPUTING, 2015, 38 : 65 - 74
  • [9] Fusion of Bag-of-Words Models for Image Classification in the Medical Domain
    Valavanis, Leonidas
    Stathopoulos, Spyridon
    Kalamboukis, Theodore
    [J]. ADVANCES IN INFORMATION RETRIEVAL, ECIR 2017, 2017, 10193 : 134 - 145
  • [10] A Novel Image Classification Method Based on Bag-of-Words Framework
    Liu, Yi
    Yu, Ming
    Xue, Cuihong
    Yang, Yueqiang
    [J]. 2018 IEEE 8TH ANNUAL INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (IEEE-CYBER), 2018, : 534 - 539