Data-Dependent Hashing Based on p-Stable Distribution

被引:39
|
作者
Bai, Xiao [1 ]
Yang, Haichuan [1 ]
Zhou, Jun [2 ]
Ren, Peng [3 ]
Cheng, Jian [4 ]
机构
[1] Beihang Univ, Sch Comp Sci & Engn, Beijing 100191, Peoples R China
[2] Griffith Univ, Sch Informat & Commun Technol, Nathan, Qld 4111, Australia
[3] China Univ Petr, Coll Informat & Control Engn, Qingdao 257061, Peoples R China
[4] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
基金
澳大利亚研究理事会;
关键词
Image retrieval; hash retrieval; p-stable distribution; QUANTIZATION;
D O I
10.1109/TIP.2014.2352458
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The p-stable distribution is traditionally used for data-independent hashing. In this paper, we describe how to perform data-dependent hashing based on p-stable distribution. We commence by formulating the Euclidean distance preserving property in terms of variance estimation. Based on this property, we develop a projection method, which maps the original data to arbitrary dimensional vectors. Each projection vector is a linear combination of multiple random vectors subject to p-stable distribution, in which the weights for the linear combination are learned based on the training data. An orthogonal matrix is then learned data-dependently for minimizing the thresholding error in quantization. Combining the projection method and orthogonal matrix, we develop an unsupervised hashing scheme, which preserves the Euclidean distance. Compared with data-independent hashing methods, our method takes the data distribution into consideration and gives more accurate hashing results with compact hash codes. Different from many data-dependent hashing methods, our method accommodates multiple hash tables and is not restricted by the number of hash functions. To extend our method to a supervised scenario, we incorporate a supervised label propagation scheme into the proposed projection method. This results in a supervised hashing scheme, which preserves semantic similarity of data. Experimental results show that our methods have outperformed several state-of-the-art hashing approaches in both effectiveness and efficiency.
引用
下载
收藏
页码:5033 / 5046
页数:14
相关论文
共 50 条
  • [21] P-STABLE MODIFICATION OF THE NUMEROV METHOD
    SHAH, SA
    COMPUTERS & MATHEMATICS WITH APPLICATIONS, 1988, 16 (12) : 1041 - 1043
  • [22] P-stable abstractions of hybrid systems
    Anna Becchi
    Alessandro Cimatti
    Enea Zaffanella
    Software and Systems Modeling, 2024, 23 : 403 - 426
  • [23] P-stable abstractions of hybrid systems
    Becchi, Anna
    Cimatti, Alessandro
    Zaffanella, Enea
    SOFTWARE AND SYSTEMS MODELING, 2024, 23 (02): : 403 - 426
  • [24] Betti numbers for p-stable ideals
    Ene, V
    Pfister, G
    Popescu, D
    COMMUNICATIONS IN ALGEBRA, 2000, 28 (03) : 1515 - 1531
  • [25] ON SUBSETS OF LP AND P-STABLE PROCESSES
    TALAGRAND, M
    ANNALES DE L INSTITUT HENRI POINCARE-PROBABILITES ET STATISTIQUES, 1989, 25 (02): : 153 - 166
  • [26] Data-Dependent QoS-Based Service Selection
    Jain, Navati
    Ding, Chen
    Liu, Xumin
    SERVICE-ORIENTED COMPUTING, (ICSOC 2016), 2016, 9936 : 617 - 625
  • [27] P-stable models of strong kernel programs
    Zepeda, Claudia
    Luis Carballido, Jose
    JOURNAL OF ALGORITHMS-COGNITION INFORMATICS AND LOGIC, 2009, 64 (01): : 41 - 50
  • [28] A new characteristic subgroup of a p-stable group
    Glauberman, George
    Solomon, Ronald
    JOURNAL OF ALGEBRA, 2012, 368 : 231 - 236
  • [29] Hashing Based Data Distribution in Heterogeneous Storage
    Zhou, Jiang
    Su, Lin
    Wang, Weiping
    Chen, Yong
    19TH IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS (ISPA/BDCLOUD/SOCIALCOM/SUSTAINCOM 2021), 2021, : 652 - 659
  • [30] Variations of Frechet measures of p-stable motions
    Blei, RC
    Towghi, N
    STATISTICS & PROBABILITY LETTERS, 2002, 56 (04) : 355 - 360