Self-Taught Hashing for Fast Similarity Search

被引:0
|
作者
Zhang, Dell [1 ]
Wang, Jun [1 ]
Cal, Deng [1 ]
Lu, Jinsong [1 ]
机构
[1] Univ London, DCSIS, London WC1E 7HX, England
关键词
Similarity Search; Semantic Hashing; Laplacian Eigenmap; Support Vector Machine; DIMENSIONALITY;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The ability of fast similarity search at large scale is of great importance to many Information Retrieval (IR) applications. A promising way to accelerate similarity search is semantic hashing which designs compact binary codes for a large number of documents so that semantically similar documents are mapped to similar codes (within a short Hamming distance). Although some recently proposed techniques are able to generate high-quality codes for documents known in advance, obtaining the codes for previously unseen documents remains to be a very challenging problem. In this paper, we emphasise this issue and propose a novel Self-Taught Hashing (STH) approach to semantic hashing: we first find the optimal l-bit binary codes for all documents in the given corpus via unsupervised learning, and then train 1 classifiers via supervised learning to predict the l-bit code for any query document unseen before. Our experiments on three real-world text datasets show that the proposed approach using binarised Laplacian Eigenmap (LapEig) and linear Support Vector Machine (SVM) outperforms state-of-the-art techniques significantly.
引用
收藏
页码:18 / 25
页数:8
相关论文
共 50 条
  • [21] LAMBERT - SELF-TAUGHT PHYSICIST
    JAKI, SL
    [J]. PHYSICS TODAY, 1977, 30 (09) : 25 - &
  • [22] Self-taught soft skills
    Alexandra Lucs
    [J]. Nature, 2014, 506 (7487) : 257 - 257
  • [23] The self-taught vocal interface
    Bart Ons
    Jort F Gemmeke
    Hugo Van hamme
    [J]. EURASIP Journal on Audio, Speech, and Music Processing, 2014
  • [24] Tamil Grammar Self-Taught
    不详
    [J]. SCOTTISH GEOGRAPHICAL MAGAZINE, 1907, 23 (05): : 272 - 272
  • [25] Egyptian Self-Taught (Arabic)
    不详
    [J]. SCOTTISH GEOGRAPHICAL MAGAZINE, 1910, 26 (03): : 163 - 163
  • [26] FILM MAKERS, SELF-TAUGHT
    PANEY, H
    [J]. INDUSTRIAL PHOTOGRAPHY, 1969, 18 (09): : 10 - &
  • [27] ON BEING A SELF-TAUGHT COMPOSER
    HUDES, E
    [J]. COMPOSER, 1980, (71): : 17 - 20
  • [28] THE SELF-TAUGHT TEACHER - RESPONSE
    CONGLETON, J
    [J]. TEACHER AS PHILOSOPHER: PROCEEDINGS OF THE THIRTY-THIRD ANNUAL MEETING OF THE SOUTH ATLANTIC PHILOSOPHY OF EDUCATION SOCIETY, 1989, : 129 - 132
  • [29] The self-taught vocal interface
    Ons, Bart
    Gemmeke, Jort F.
    Van hamme, Hugo
    [J]. EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2014, : 1 - 16
  • [30] Portrait of the writer self-taught
    Biron, Michel
    [J]. ANALYSES-REVUE DE CRITIQUE ET DE THEORIE LITTERAIRE, 2007, 2 (03):