Lp-Norm IDF for Scalable Image Retrieval

被引:48
|
作者
Zheng, Liang [1 ]
Wang, Shengjin [1 ]
Tian, Qi [2 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, State Key Lab Intelligent Technol & Syst, Tsinghua Natl Lab Informat Sci & Technol, Beijing 100084, Peoples R China
[2] Univ Texas San Antonio, Dept Comp Sci, San Antonio, TX 78249 USA
基金
国家高技术研究发展计划(863计划); 美国国家科学基金会;
关键词
Image retrieval; Lp-norm IDF; burstiness; visual word frequency; OBJECT RETRIEVAL; SIMILARITY; VOCABULARY; GEOMETRY; FEATURES; SEARCH; SET;
D O I
10.1109/TIP.2014.2329182
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The inverse document frequency (IDF) is prevalently utilized in the bag-of-words-based image retrieval application. The basic idea is to assign less weight to terms with high frequency, and vice versa. However, in the conventional IDF routine, the estimation of visual word frequency is coarse and heuristic. Therefore, its effectiveness is largely compromised and far from optimal. To address this problem, this paper introduces a novel IDF family by the use of Lp-norm pooling technique. Carefully designed, the proposed IDF considers the term frequency, document frequency, the complexity of images, as well as the codebook information. We further propose a parameter tuning strategy, which helps to produce optimal balancing between TF and pIDF weights, yielding the so-called Lp-norm IDF (pIDF). We show that the conventional IDF is a special case of our generalized version, and two novel IDFs, i.e., the average IDF and the max IDF, can be defined from the concept of pIDF. Further, by counting for the term-frequency in each image, the proposed pIDF helps to alleviate the visual word burstiness phenomenon. Our method is evaluated through extensive experiments on four benchmark data sets (Oxford 5K, Paris 6K, Holidays, and Ukbench). We show that the pIDF works well on large scale databases and when the codebook is trained on irrelevant data. We report an mean average precision improvement of as large as +13.0% over the baseline TF-IDF approach on a 1M data set. In addition, the pIDF has a wide application scope varying from buildings to general objects and scenes. When combined with postprocessing steps, we achieve competitive results compared with the state-of-the-art methods. In addition, since the pIDF is computed offline, no extra computation or memory cost is introduced to the system at all.
引用
收藏
页码:3604 / 3617
页数:14
相关论文
共 50 条
  • [1] Lp-norm IDF for Large Scale Image Search
    Zheng, Liang
    Wang, Shengjin
    Liu, Ziqiong
    Tian, Qi
    2013 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2013, : 1626 - 1633
  • [2] AN LP-NORM INEQUALITY
    WU, PY
    AMERICAN MATHEMATICAL MONTHLY, 1983, 90 (06): : 411 - 412
  • [3] LP-NORM DECONVOLUTION
    DEBEYE, HWJ
    VANRIEL, P
    GEOPHYSICAL PROSPECTING, 1990, 38 (04) : 381 - 403
  • [4] Geometric lp-norm Feature Pooling for Image Classification
    Feng, Jiashi
    Ni, Bingbing
    Tian, Qi
    Yan, Shuicheng
    2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011,
  • [5] Padua point interpolation and Lp-norm minimisation in colour-based image indexing and retrieval
    Montagna, R.
    Finlayson, G. D.
    IET IMAGE PROCESSING, 2012, 6 (02) : 139 - 147
  • [6] Image Denoising Using Lp-norm of Mean Curvature of Image Surface
    Zhu, Wei
    JOURNAL OF SCIENTIFIC COMPUTING, 2020, 83 (02)
  • [7] Trigonometric approximation in Lp-norm
    Leindler, L
    JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2005, 302 (01) : 129 - 136
  • [8] Single Underwater Image Enhancement Based on LP-Norm Decomposition
    Wang, Jianhua
    Wang, Huibin
    Gao, Guowei
    Lu, Huimin
    Zhang, Zhen
    IEEE ACCESS, 2019, 7 : 145199 - 145213
  • [9] On the Simultaneous Approximation in Lp-norm
    周颂平
    Journal of Mathematical Research with Applications, 1984, (04) : 36 - 36
  • [10] lp-Norm Multiway Cut
    Chandrasekaran, Karthekeyan
    Wang, Weihang
    ALGORITHMICA, 2022, 84 (09) : 2667 - 2701