Large-scale near-duplicate image retrieval by kernel density estimation

被引:0
|
作者
Tong, Wei [1 ]
Li, Fengjie [2 ]
Jin, Rong [2 ]
Jain, Anil [2 ]
机构
[1] Carnegie Mellon Univ, Language Technol Inst, Pittsburgh, PA 15213 USA
[2] Michigan State Univ, Dept Comp Sci & Engn, E Lansing, MI 48824 USA
关键词
Content-based image retrieval; Near-duplicate image retrieval; Kernel density estimation; Bag-of-words model;
D O I
10.1007/s13735-012-0012-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Bag-of-words model is one of the most widely used methods in the recent studies of multimedia data retrieval. The key idea of the bag-of-words model is to quantize the bag of local features, for example SIFT, to a histogram of visual words and then standard information retrieval technologies developed from text retrieval can be applied directly. Despite its success, one problem of the bag-of-words model is that the two key steps, i.e., feature quantization and retrieval, are separated. In other words, the step of generating bag-of-words representation is not optimized for the step of retrieval which often leads to a sub-optimal performance. In this paper we propose a statistical framework for large-scale near-duplication image retrieval which unifies the two steps by introducing kernel density function. The central idea of the proposed method is to represent each image by a kernel density function and the similarity between the query image and a database image is then estimated as the query likelihood. In order to make the proposed method applicable to large-scale data sets, we have developed efficient algorithms for both estimating the density function of each image and computing the query likelihood. Our empirical studies confirm that the proposed method is not only more effective but also more efficient than the bag-of-words model.
引用
收藏
页码:45 / 58
页数:14
相关论文
共 50 条
  • [1] Large-scale near-duplicate image retrieval by kernel density estimation
    Wei Tong
    Fengjie Li
    Rong Jin
    Anil Jain
    International Journal of Multimedia Information Retrieval, 2012, 1 (1) : 45 - 58
  • [2] Large Scale Near-duplicate Image Retrieval via Patch Embedding
    Yan, Shangpeng
    Zhang, Xiaoyun
    Bao, Wenbo
    Chen, Li
    Gao, Zhiyong
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 2972 - 2979
  • [3] Large-Scale Near-Duplicate Web Video Retrieval: Challenges and Approaches
    Cai, Yang
    Yang, Linjun
    IEEE MULTIMEDIA, 2013, 20 (02) : 42 - 51
  • [4] Stochastic Multiview Hashing for Large-Scale Near-Duplicate Video Retrieval
    Hao, Yanbin
    Mu, Tingting
    Hong, Richang
    Wang, Meng
    An, Ning
    Goulermas, John Y.
    IEEE TRANSACTIONS ON MULTIMEDIA, 2017, 19 (01) : 1 - 14
  • [5] Advance on large scale near-duplicate video retrieval
    Shen, Ling
    Hong, Richang
    Hao, Yanbin
    FRONTIERS OF COMPUTER SCIENCE, 2020, 14 (05)
  • [6] Advance on large scale near-duplicate video retrieval
    Ling Shen
    Richang Hong
    Yanbin Hao
    Frontiers of Computer Science, 2020, 14
  • [7] GPU-based MapReduce for large-scale near-duplicate video retrieval
    Wang, Hanli
    Zhu, Fengkuangtian
    Xiao, Bo
    Wang, Lei
    Jiang, Yu-Gang
    MULTIMEDIA TOOLS AND APPLICATIONS, 2015, 74 (23) : 10515 - 10534
  • [8] SVD: A Large-Scale Short Video Dataset for Near-Duplicate Video Retrieval
    Jiang, Qing-Yuan
    He, Yi
    Li, Gen
    Lin, Jian
    Li, Lei
    Li, Wu-Jun
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 5280 - 5288
  • [9] Effective Multiple Feature Hashing for Large-Scale Near-Duplicate Video Retrieval
    Song, Jingkuan
    Yang, Yi
    Huang, Zi
    Shen, Heng Tao
    Luo, Jiebo
    IEEE TRANSACTIONS ON MULTIMEDIA, 2013, 15 (08) : 1997 - 2008
  • [10] GPU-based MapReduce for large-scale near-duplicate video retrieval
    Hanli Wang
    Fengkuangtian Zhu
    Bo Xiao
    Lei Wang
    Yu-Gang Jiang
    Multimedia Tools and Applications, 2015, 74 : 10515 - 10534