Data-Dependent Hashing Based on p-Stable Distribution

被引:39
|
作者
Bai, Xiao [1 ]
Yang, Haichuan [1 ]
Zhou, Jun [2 ]
Ren, Peng [3 ]
Cheng, Jian [4 ]
机构
[1] Beihang Univ, Sch Comp Sci & Engn, Beijing 100191, Peoples R China
[2] Griffith Univ, Sch Informat & Commun Technol, Nathan, Qld 4111, Australia
[3] China Univ Petr, Coll Informat & Control Engn, Qingdao 257061, Peoples R China
[4] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
基金
澳大利亚研究理事会;
关键词
Image retrieval; hash retrieval; p-stable distribution; QUANTIZATION;
D O I
10.1109/TIP.2014.2352458
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The p-stable distribution is traditionally used for data-independent hashing. In this paper, we describe how to perform data-dependent hashing based on p-stable distribution. We commence by formulating the Euclidean distance preserving property in terms of variance estimation. Based on this property, we develop a projection method, which maps the original data to arbitrary dimensional vectors. Each projection vector is a linear combination of multiple random vectors subject to p-stable distribution, in which the weights for the linear combination are learned based on the training data. An orthogonal matrix is then learned data-dependently for minimizing the thresholding error in quantization. Combining the projection method and orthogonal matrix, we develop an unsupervised hashing scheme, which preserves the Euclidean distance. Compared with data-independent hashing methods, our method takes the data distribution into consideration and gives more accurate hashing results with compact hash codes. Different from many data-dependent hashing methods, our method accommodates multiple hash tables and is not restricted by the number of hash functions. To extend our method to a supervised scenario, we incorporate a supervised label propagation scheme into the proposed projection method. This results in a supervised hashing scheme, which preserves semantic similarity of data. Experimental results show that our methods have outperformed several state-of-the-art hashing approaches in both effectiveness and efficiency.
引用
收藏
页码:5033 / 5046
页数:14
相关论文
共 50 条
  • [1] LABEL PROPAGATION HASHING BASED ON P-STABLE DISTRIBUTION AND COORDINATE DESCENT
    Yang, Haichuan
    Bai, Xiao
    Liu, Chuntian
    Zhou, Jun
    [J]. 2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 2674 - 2678
  • [2] Data-Dependent Hashing via Nonlinear Spectral Gaps
    Andoni, Alexandr
    Naor, Assaf
    Nikolov, Aleksandar
    Razenshteyn, Ilya
    Waingarten, Erik
    [J]. STOC'18: PROCEEDINGS OF THE 50TH ANNUAL ACM SIGACT SYMPOSIUM ON THEORY OF COMPUTING, 2018, : 787 - 800
  • [3] Optimal Data-Dependent Hashing for Approximate Near Neighbors
    Andoni, Alexandr
    Razenshteyn, Ilya
    [J]. STOC'15: PROCEEDINGS OF THE 2015 ACM SYMPOSIUM ON THEORY OF COMPUTING, 2015, : 793 - 801
  • [4] Efficient Anchor Graph Hashing with Data-Dependent Anchor Selection
    Takebe, Hiroaki
    Uehara, Yusuke
    Uchida, Seiichi
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2015, E98D (11) : 2030 - 2033
  • [5] Expressing p-stable semantics based on stable semantics
    Osorio, Mauricio
    Luis Carballido, Jose
    Zepeda, Claudia
    [J]. 20TH INTERNATIONAL CONFERENCE ON ELECTRONICS COMMUNICATIONS AND COMPUTERS (CONIELECOMP 2010), 2010, : 227 - 231
  • [6] P-Stable Polygons
    A. A. Stepanova
    D. O. Ptakhov
    [J]. Algebra and Logic, 2017, 56 : 324 - 336
  • [7] P-Stable Polygons
    Stepanova, A. A.
    Ptakhov, D. O.
    [J]. ALGEBRA AND LOGIC, 2017, 56 (04) : 324 - 336
  • [8] P-Stable Abelian Groups
    Palyutin, E. A.
    [J]. ALGEBRA AND LOGIC, 2013, 52 (05) : 404 - 421
  • [9] P-Stable Abelian Groups
    E. A. Palyutin
    [J]. Algebra and Logic, 2013, 52 : 404 - 421
  • [10] A cipher based on data-dependent permutations
    Moldovyan, AA
    Moldovyan, NA
    [J]. JOURNAL OF CRYPTOLOGY, 2002, 15 (01) : 61 - 72