Efficient method for image classification based on low-scale bag of word model

被引:0
|
作者
Xiao Z. [1 ]
Qin Z.-G. [1 ]
Ding Y. [1 ]
Lan T. [1 ]
Yu Y. [1 ]
机构
[1] School of Information and Software Engineering, University of Electronic Science and Technology of China, Chengdu
关键词
Bag-of-words; Computer vision; Image classification; Scale invariant feture transform (SIFT); Wavelet transform;
D O I
10.3969/j.issn.1001-0548.2016.06.021
中图分类号
学科分类号
摘要
This paper proposes a new framework to improve the efficiency of visual bag-of-words model in large scale image classification. The method is based on the low scale image representation obtained by wavelet transform, and the low scale visual dictionary is built by extracting the SIFT features on the low scale image. Since the feature dimension is reduced, the method can quickly generate the visual dictionary and minimize the time of image classification process. The results of comparison experiments on the 8677 images of Caltech 101 show that the proposed method can effectively improve the classification performance and efficiency of the traditional visual bag-of-words model and the Pyramid-BOW model. © 2016, Editorial Board of Journal of the University of Electronic Science and Technology of China. All right reserved.
引用
收藏
页码:997 / 1001
页数:4
相关论文
共 20 条
  • [1] Joachims T., Text Categorization with Support Vector Machines: Learning with Many Relevant Features, (1998)
  • [2] Sivic J., Zisserman A., Video google: a text retrieval approach to object matching in videos, Ninth IEEE International Conference on Proceedings of the Computer Vision, (2003)
  • [3] Fei-Fei L., Perona P., A bayesian hierarchical model for learning natural scene categories, Proceedings of the Computer Vision and Pattern Recognition, pp. 524-531, (2005)
  • [4] Fergus R., Fei-Fei L., Perona P., Et al., Learning object categories from Google's image search, Tenth IEEE International Conference on Proceedings of the Computer Vision, pp. 1816-1823, (2005)
  • [5] Sudderth E.B., Torralba A., Freeman W.T., Et al., Learning hierarchical models of scenes, objects, and parts, Tenth IEEE International Conference on Proceedings of the Computer Vision, pp. 1331-1338, (2005)
  • [6] Ramesh B., Xiang C., Lee T.H., Shape classification using invariant features and contextual information in the bag-of-words model, Pattern Recognition, 48, 3, pp. 894-906, (2015)
  • [7] Khan R., Barat C., Muselet D., Et al., Spatial histograms of soft pairwise similar patches to improve the bag-of-visual-words model, Computer Vision and Image Understanding, 132, pp. 102-112, (2015)
  • [8] Lei W., Toward a discriminative codebook: Codeword selection across multi-resolution, IEEE Conference on Proceedings of the Computer Vision and Pattern Recognition, pp. 1-8, (2007)
  • [9] Jianchao Y., Kai Y., Yihong G., Et al., Linear spatial pyramid matching using sparse coding for image classification, IEEE Conference on Proceedings of the Computer Vision and Pattern Recognition, pp. 1794-1801, (2009)
  • [10] Jinjun W., Jianchao Y., Kai Y., Et al., Locality-constrained linear coding for image classification, IEEE Conference on Proceedings of the Computer Vision and Pattern Recognition, pp. 3360-3367, (2010)