Self-Taught Hashing for Fast Similarity Search

被引:0
|
作者
Zhang, Dell [1 ]
Wang, Jun [1 ]
Cal, Deng [1 ]
Lu, Jinsong [1 ]
机构
[1] Univ London, DCSIS, London WC1E 7HX, England
关键词
Similarity Search; Semantic Hashing; Laplacian Eigenmap; Support Vector Machine; DIMENSIONALITY;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The ability of fast similarity search at large scale is of great importance to many Information Retrieval (IR) applications. A promising way to accelerate similarity search is semantic hashing which designs compact binary codes for a large number of documents so that semantically similar documents are mapped to similar codes (within a short Hamming distance). Although some recently proposed techniques are able to generate high-quality codes for documents known in advance, obtaining the codes for previously unseen documents remains to be a very challenging problem. In this paper, we emphasise this issue and propose a novel Self-Taught Hashing (STH) approach to semantic hashing: we first find the optimal l-bit binary codes for all documents in the given corpus via unsupervised learning, and then train 1 classifiers via supervised learning to predict the l-bit code for any query document unseen before. Our experiments on three real-world text datasets show that the proposed approach using binarised Laplacian Eigenmap (LapEig) and linear Support Vector Machine (SVM) outperforms state-of-the-art techniques significantly.
引用
收藏
页码:18 / 25
页数:8
相关论文
共 50 条
  • [31] THE SELF-TAUGHT VOCAL INTERFACE
    Gemmeke, Jort F.
    [J]. 2014 4TH JOINT WORKSHOP ON HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS (HSCMA), 2014, : 21 - 22
  • [32] Bayesian Locality Sensitive Hashing for Fast Similarity Search
    Satuluri, Venu
    Parthasarathy, Srinivasan
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2012, 5 (05): : 430 - 441
  • [33] Weighted Hashing for Fast Large Scale Similarity Search
    Wang, Qifan
    Zhang, Dan
    Si, Luo
    [J]. PROCEEDINGS OF THE 22ND ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM'13), 2013, : 1185 - 1188
  • [34] SCHOOLS OF EDUCATION - THE SELF-TAUGHT TEACHER
    HESSLING, P
    DEMPSEY, V
    [J]. TEACHER AS PHILOSOPHER: PROCEEDINGS OF THE THIRTY-THIRD ANNUAL MEETING OF THE SOUTH ATLANTIC PHILOSOPHY OF EDUCATION SOCIETY, 1989, : 119 - 128
  • [35] Self-Taught and Successful: Omur Tokgoz
    Kalay, Leman
    [J]. CERAMICS-ART AND PERCEPTION, 2023, (121) : 98 - 103
  • [36] TEACHER-EDUCATION SELF-TAUGHT
    BECK, CE
    [J]. JOURNAL OF TEACHER EDUCATION, 1963, 14 (01) : 96 - 97
  • [37] OUTSIDERISM: A DISCOURSE ON SELF-TAUGHT ART
    Gomez, Edward M.
    Anderson, Brooke Davis
    [J]. ART IN AMERICA, 2010, 98 (04): : 53 - +
  • [38] Self-taught support vector machines
    Razzaghi, Parvin
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2019, 59 (03) : 685 - 709
  • [39] Haridas Banerjee: A self-taught physicist
    Roy, SM
    [J]. CURRENT SCIENCE, 1997, 72 (05): : 348 - 349
  • [40] SPELLING ABILITY OF A SELF-TAUGHT READER
    GOODMAN, YM
    GOODMAN, KS
    [J]. ELEMENTARY SCHOOL JOURNAL, 1963, 64 (03): : 149 - 154