Image classification by visual bag-of-words refinement and reduction

被引:19
|
作者
Lu, Zhiwu [1 ]
Wang, Liwei [2 ]
Wen, Ji-Rong [1 ]
机构
[1] Renmin Univ China, Sch Informat, Beijing 100872, Peoples R China
[2] Peking Univ, Sch EECS, Key Lab Machine Percept MOE, Beijing 100871, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Image classification; Visual BOW refinement; Visual BOW reduction; Graph-based method; Semantic spectral clustering; DIMENSIONALITY REDUCTION; FRAMEWORK; SCENE;
D O I
10.1016/j.neucom.2015.01.098
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a new framework for visual bag-of-words (BOW) refinement and reduction to overcome the drawbacks associated with the visual BOW model which has been widely used for image classification. Although very influential in the literature, the traditional visual BOW model has two distinct drawbacks. Firstly, for efficiency purposes, the visual vocabulary is commonly constructed by directly clustering the low-level visual feature vectors extracted from local keypoints, without considering the high-level semantics of images. That is, the visual BOW model still suffers from the semantic gap, and thus may lead to significant performance degradation in more challenging tasks (e.g. social image classification). Secondly, typically thousands of visual words are generated to obtain better performance on a relatively large image dataset. Due to such large vocabulary size, the subsequent image classification may take sheer amount of time. To overcome the first drawback, we develop a graph-based method for visual BOW refinement by exploiting the tags (easy to access although noisy) of social images. More notably, for efficient image classification, we further reduce the refined visual BOW model to a much smaller size through semantic spectral clustering. Extensive experimental results show the promising performance of the proposed framework for visual BOW refinement and reduction. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:373 / 384
页数:12
相关论文
共 50 条
  • [1] Visual Attention based Bag-of-Words Model for Image Classification
    Wang, Qiwei
    Wan, Shouhong
    Yue, Lihua
    Wang, Che
    [J]. 6TH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2014), 2014, 9159
  • [2] Importance of feature locations in bag-of-words image classification
    Lazic, Nevena
    Aarabi, Parham
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PTS 1-3, PROCEEDINGS, 2007, : 641 - 644
  • [3] Early versus Late Dimensionality Reduction of Bag-of-Words Feature Representation for Image Classification
    Tsai, Chih-Fong
    Hu, Ya-Han
    Lin, Wei-Chao
    Wang, Ming-Chang
    [J]. PROCEEDINGS OF 2017 INTERNATIONAL CONFERENCE ON BIOINFORMATICS RESEARCH AND APPLICATIONS (ICBRA 2017), 2015, : 42 - 45
  • [4] Image categorization based on visual saliency and Bag-of-Words model
    Li, Wenxiang
    Chen, Yanfei
    Wu, Zecheng
    Peng, Hongsheng
    [J]. MIPPR 2019: PATTERN RECOGNITION AND COMPUTER VISION, 2020, 11430
  • [5] How to use Bag-of-Words model better for image classification
    Wang, Chong
    Huang, Kaiqi
    [J]. IMAGE AND VISION COMPUTING, 2015, 38 : 65 - 74
  • [6] Fusion of Bag-of-Words Models for Image Classification in the Medical Domain
    Valavanis, Leonidas
    Stathopoulos, Spyridon
    Kalamboukis, Theodore
    [J]. ADVANCES IN INFORMATION RETRIEVAL, ECIR 2017, 2017, 10193 : 134 - 145
  • [7] A Novel Image Classification Method Based on Bag-of-Words Framework
    Liu, Yi
    Yu, Ming
    Xue, Cuihong
    Yang, Yueqiang
    [J]. 2018 IEEE 8TH ANNUAL INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (IEEE-CYBER), 2018, : 534 - 539
  • [8] Contextual Bag-of-Words for Visual Categorization
    Li, Teng
    Mei, Tao
    Kweon, In-So
    Hua, Xian-Sheng
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2011, 21 (04) : 381 - 392
  • [9] An Image Classification Method Based on Optimized Fuzzy Bag-of-words Model
    Li, Zilong
    Zhou, Yong
    Bao, Rong
    [J]. TRAITEMENT DU SIGNAL, 2019, 36 (03) : 239 - 244
  • [10] Image Classification with Bag-of-Words Model Based on Improved SIFT Algorithm
    Gao, Huilin
    Dou, Lihua
    Chen, Wenjie
    Sun, Jian
    [J]. 2013 9TH ASIAN CONTROL CONFERENCE (ASCC), 2013,