Entropy Optimized Feature-Based Bag-of-Words Representation for Information Retrieval

被引:39
|
作者
Passalis, Nikolaos [1 ]
Tefas, Anastasios [1 ]
机构
[1] Aristotle Univ Thessaloniki, Dept Informat, Thessaloniki 54124, Greece
关键词
Information search and retrieval; dictionary learning; entropy optimization; image retrieval; time-series retrieval; IMAGE RETRIEVAL; RELEVANCE-FEEDBACK;
D O I
10.1109/TKDE.2016.2545657
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present a supervised dictionary learning method for optimizing the feature-based Bag-of-Words (BoW) representation towards Information Retrieval. Following the cluster hypothesis, which states that points in the same cluster are likely to fulfill the same information need, we propose the use of an entropy-based optimization criterion that is better suited for retrieval instead of classification. We demonstrate the ability of the proposed method, abbreviated as EO-BoW, to improve the retrieval performance by providing extensive experiments on two multi-class image datasets. The BoW model can be applied to other domains as well, so we also evaluate our approach using a collection of 45 time-series datasets, a text dataset, and a video dataset. The gains are three-fold since the EO-BoW can improve the mean Average Precision, while reducing the encoding time and the database storage requirements. Finally, we provide evidence that the EO-BoW maintains its representation ability even when used to retrieve objects from classes that were not seen during the training.
引用
收藏
页码:1664 / 1677
页数:14
相关论文
共 50 条
  • [41] A fast, feature-based cluster algorithm for information retrieval
    Mehlitz, Martin
    Bauckhage, Christian
    Albayrak, Sahin
    [J]. IRI 2007: PROCEEDINGS OF THE 2007 IEEE INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION, 2007, : 335 - +
  • [42] Texture Classification Using Scale Invariant Feature Transform and Bag-of-Words
    Budak, Umit
    Sengur, Abdulkadir
    [J]. 2015 23RD SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2015, : 152 - 155
  • [43] BAG-OF-WORDS REPRESENTATION FOR NON-INTRUSIVE SPEECH QUALITY ASSESSMENT
    Li, Qiaohong
    Lin, Weisi
    Fang, Yuming
    Thalmann, Daniel
    [J]. 2015 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING, 2015, : 616 - 619
  • [44] Concept Based Representations as Complement of Bag of Words in Information Retrieval
    Carrillo, Maya
    Lopez-Lopez, Aurelio
    [J]. ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, 2010, 339 : 154 - 161
  • [45] Bag-of-Words Vector Quantization Based Face Identification
    Liu, Di
    Sun, Dong-mei
    Qiu, Zheng-ding
    [J]. PROCEEDINGS OF THE SECOND INTERNATIONAL SYMPOSIUM ON ELECTRONIC COMMERCE AND SECURITY, VOL II, 2009, : 29 - 33
  • [46] Improving Bag-Of-Words: Capturing Local Information for Motion-Based Activity Recognition
    Zeng, Ming
    Yu, Tong
    Mengshoel, Ole J.
    Qin, Helen
    Lee, Chris
    Shen, John Paul
    [J]. PROCEEDINGS OF THE 2018 ACM INTERNATIONAL JOINT CONFERENCE ON PERVASIVE AND UBIQUITOUS COMPUTING AND PROCEEDINGS OF THE 2018 ACM INTERNATIONAL SYMPOSIUM ON WEARABLE COMPUTERS (UBICOMP/ISWC'18 ADJUNCT), 2018, : 1345 - 1354
  • [47] Internet Traffic Classification based on bag-of-words model
    Zhang, Yin
    Zhou, Yi
    Chen, Kai
    [J]. 2012 IEEE GLOBECOM WORKSHOPS (GC WKSHPS), 2012, : 736 - 741
  • [48] A Novel Codebook Representation Method and Encoding Strategy For Bag-of-Words Based Acoustic Event Classification
    Dai, Jia
    Ni, Chongjia
    Xue, Wei
    Liu, Wenju
    [J]. 2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 31 - 34
  • [49] SAR Target Discrimination Algorithm Based on Bag-of-words Model with Multi-feature Fusion
    [J]. Wang, Yinghua (yhwang@xidian.edu.cn), 1600, Science Press (39):
  • [50] Saliency map driven image retrieval combining the bag-of-words model and PLSA
    Giouvanakis, Emmanouil
    Kotropoulos, Constantine
    [J]. 2014 19TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2014, : 280 - 285