Semantics-Preserving Bag-of-Words Models and Applications

被引:141
|
作者
Wu, Lei [1 ]
Hoi, Steven C. H. [2 ]
Yu, Nenghai [1 ]
机构
[1] Univ Sci & Technol China, Dept EEIS, MOE MS Key Lab MCC, Hefei 230026, Peoples R China
[2] Nanyang Technol Univ, Sch Comp Engn, Singapore 639798, Singapore
基金
国家高技术研究发展计划(863计划);
关键词
Bag-of-words models; distance metric learning; image annotation; object representation; semantic gap;
D O I
10.1109/TIP.2010.2045169
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The Bag-of-Words (BoW) model is a promising image representation technique for image categorization and annotation tasks. One critical limitation of existing BoW models is that much semantic information is lost during the codebook generation process, an important step of BoW. This is because the codebook generated by BoW is often obtained via building the codebook simply by clustering visual features in Euclidian space. However, visual features related to the same semantics may not distribute in clusters in the Euclidian space, which is primarily due to the semantic gap between low-level features and high-level semantics. In this paper, we propose a novel scheme to learn optimized BoW models, which aims to map semantically related features to the same visual words. In particular, we consider the distance between semantically identical features as a measurement of the semantic gap, and attempt to learn an optimized codebook by minimizing this gap, aiming to achieve the minimal loss of the semantics. We refer to such kind of novel codebook as semantics- preserving codebook (SPC) and the corresponding model as the Semantics-Preserving Bag-of-Words (SPBoW) model. Extensive experiments on image annotation and object detection tasks with public testbeds from MIT's Labelme and PASCAL VOC challenge databases show that the proposed SPC learning scheme is effective for optimizing the codebook generation process, and the SPBoW model is able to greatly enhance the performance of the existing BoW model.
引用
收藏
页码:1908 / 1920
页数:13
相关论文
共 50 条
  • [1] Enhancing Bag-of-Words Models with Semantics-Preserving Metric Learning
    Wu, Lei
    Hoi, Steven C. H.
    [J]. IEEE MULTIMEDIA, 2011, 18 (01) : 24 - 37
  • [2] ECG Biometrics Using Bag-of-Words Models
    Ciocoiu, Iulian B.
    [J]. 2015 INTERNATIONAL SYMPOSIUM ON SIGNALS, CIRCUITS AND SYSTEMS (ISSCS), 2015,
  • [3] Incorporating Temporal Context in Bag-of-Words Models
    Glaser, Tamar
    Zelnik-Manor, Lihi
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCV WORKSHOPS), 2011,
  • [4] Semantics-Preserving Fusion of Structures of Probabilistic Graphical Models
    Yue, Kun
    Zhu, Yunlei
    Tian, Kailin
    Liu, Weiyi
    [J]. FOUNDATIONS OF INTELLIGENT SYSTEMS (ISKE 2011), 2011, 122 : 63 - +
  • [5] Supervised Learning and Codebook Optimization for Bag-of-Words Models
    Jiu, Mingyuan
    Wolf, Christian
    Garcia, Christophe
    Baskurt, Atilla
    [J]. COGNITIVE COMPUTATION, 2012, 4 (04) : 409 - 419
  • [6] Supervised Learning and Codebook Optimization for Bag-of-Words Models
    Mingyuan Jiu
    Christian Wolf
    Christophe Garcia
    Atilla Baskurt
    [J]. Cognitive Computation, 2012, 4 : 409 - 419
  • [7] Semantics-Preserving XML Query
    Song, Jiehu
    Lin, Zuoquan
    [J]. 2020 5TH INTERNATIONAL CONFERENCE ON MATHEMATICS AND ARTIFICIAL INTELLIGENCE (ICMAI 2020), 2020, : 85 - 92
  • [8] Accelerating Bag-of-Words with SOM
    Chen, Jian-Hui
    Wang, Zuo-Ren
    Liu, Cheng-Lin
    [J]. NEURAL INFORMATION PROCESSING (ICONIP 2019), PT III, 2019, 11955 : 573 - 584
  • [9] Semantic Bag-of-Words Models for Visual Concept Detection and Annotation
    Zhang, Yu
    Bres, Stphane
    Chen, Liming
    [J]. 8TH INTERNATIONAL CONFERENCE ON SIGNAL IMAGE TECHNOLOGY & INTERNET BASED SYSTEMS (SITIS 2012), 2012, : 289 - 295
  • [10] Fusion of Bag-of-Words Models for Image Classification in the Medical Domain
    Valavanis, Leonidas
    Stathopoulos, Spyridon
    Kalamboukis, Theodore
    [J]. ADVANCES IN INFORMATION RETRIEVAL, ECIR 2017, 2017, 10193 : 134 - 145