Two-Stage Hashing for Fast Document Retrieval

被引:0
|
作者
Li, Hao [1 ]
Liu, Wei [2 ]
Ji, Heng [1 ]
机构
[1] Rensselaer Polytech Inst, Dept Comp Sci, Troy, NY 12180 USA
[2] IBM TJ Watson Res Ctr, Yorktown Hts, NY USA
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This work fulfills sublinear time Nearest Neighbor Search (NNS) in massive-scale document collections. The primary contribution is to propose a two-stage unsupervised hashing framework which harmoniously integrates two state-of-the-art hashing algorithms Locality Sensitive Hashing (LSH) and Iterative Quantization (ITQ). LSH accounts for neighbor candidate pruning, while ITQ provides an efficient and effective reranking over the neighbor pool captured by LSH. Furthermore, the proposed hashing framework capitalizes on both term and topic similarity among documents, leading to precise document retrieval. The experimental results convincingly show that our hashing based document retrieval approach well approximates the conventional Information Retrieval (IR) method in terms of retrieving semantically similar documents, and meanwhile achieves a speedup of over one order of magnitude in query time.
引用
下载
收藏
页码:495 / 500
页数:6
相关论文
共 50 条
  • [1] Two-Stage Unsupervised Deep Hashing for Image Retrieval
    Gan, Yuan-Zhu
    Hu, Hao
    Yang, Yu-Bin
    PRICAI 2018: TRENDS IN ARTIFICIAL INTELLIGENCE, PT I, 2018, 11012 : 477 - 489
  • [2] Two-Stage Document Length Normalization for Information Retrieval
    Na, Seung-Hoon
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2015, 33 (02) : 8
  • [3] Two-Stage Supervised Discrete Hashing for Cross-Modal Retrieval
    Zhang, Donglin
    Xiao-Jun Wu
    Xu, Tianyang
    Kittler, Josef
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (11): : 7014 - 7026
  • [4] Margin-based two-stage supervised hashing for image retrieval
    Liu, Ye
    Pan, Yan
    Lai, Hanjiang
    Liu, Cong
    Yin, Jian
    NEUROCOMPUTING, 2016, 214 : 894 - 901
  • [5] Two-Stage Asymmetric Similarity Preserving Hashing for Cross-Modal Retrieval
    Huang, Junfan
    Kang, Peipei
    Han, Na
    Chen, Yonghao
    Fang, Xiaozhao
    Gao, Hongbo
    Zhou, Guoxu
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (01) : 429 - 444
  • [6] Two-stage zero-shot sparse hashing with missing labels for cross-modal retrieval
    Yong, Kailing
    Shu, Zhenqiu
    Wang, Hongbin
    Yu, Zhengtao
    PATTERN RECOGNITION, 2024, 155
  • [7] A new fast image retrieval using the condensed two-stage search method
    Cho, JW
    Jeong, SD
    Lee, GS
    Cho, SH
    Choi, BU
    IEICE TRANSACTIONS ON COMMUNICATIONS, 2003, E86B (12) : 3658 - 3661
  • [8] A two-stage framework for polygon retrieval
    Tung, LH
    King, I
    MULTIMEDIA TOOLS AND APPLICATIONS, 2000, 11 (02) : 235 - 255
  • [9] A Two-Stage Framework for Polygon Retrieval
    Lun Hsing Tung
    Irwin King
    Multimedia Tools and Applications, 2000, 11 : 235 - 255
  • [10] Hidden semantic hashing for fast retrieval over large scale document collection
    Fuhao Zou
    Xiaoman Tang
    Kai Li
    Yunfei Wang
    Jingkuan Song
    Shuangyuan Yang
    Hefei Ling
    Multimedia Tools and Applications, 2018, 77 : 3677 - 3697