Two-Stage Hashing for Fast Document Retrieval

被引:0
|
作者
Li, Hao [1 ]
Liu, Wei [2 ]
Ji, Heng [1 ]
机构
[1] Rensselaer Polytech Inst, Dept Comp Sci, Troy, NY 12180 USA
[2] IBM TJ Watson Res Ctr, Yorktown Hts, NY USA
关键词
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This work fulfills sublinear time Nearest Neighbor Search (NNS) in massive-scale document collections. The primary contribution is to propose a two-stage unsupervised hashing framework which harmoniously integrates two state-of-the-art hashing algorithms Locality Sensitive Hashing (LSH) and Iterative Quantization (ITQ). LSH accounts for neighbor candidate pruning, while ITQ provides an efficient and effective reranking over the neighbor pool captured by LSH. Furthermore, the proposed hashing framework capitalizes on both term and topic similarity among documents, leading to precise document retrieval. The experimental results convincingly show that our hashing based document retrieval approach well approximates the conventional Information Retrieval (IR) method in terms of retrieving semantically similar documents, and meanwhile achieves a speedup of over one order of magnitude in query time.
引用
下载
收藏
页码:495 / 500
页数:6
相关论文
共 50 条
  • [41] Two-stage semantic matching for cross-media retrieval
    Xu G.
    Xu L.
    Zhang M.
    Li X.
    International Journal of Performability Engineering, 2018, 14 (04) : 795 - 804
  • [42] A fast two-stage algorithm for realizing matching pursuit
    Cheung, KP
    Chan, YH
    2001 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL II, PROCEEDINGS, 2001, : 431 - 434
  • [43] TWO-STAGE SENTENCE SELECTION APPROACH FOR MULTI-DOCUMENT SUMMARIZATION
    Zhang Shu Zhao Tiejun Zheng Dequan Zhao Hua (Department of Computer Science and Technology
    Journal of Electronics(China), 2008, (04) : 562 - 567
  • [44] Two-stage generative adversarial networks for binarization of color document images
    Suh, Sungho
    Kim, Jihun
    Lukowicz, Paul
    Lee, Yong Oh
    PATTERN RECOGNITION, 2022, 130
  • [45] A novel deep hashing method for fast image retrieval
    Cheng, Shuli
    Lai, Huicheng
    Wang, Liejun
    Qin, Jiwei
    VISUAL COMPUTER, 2019, 35 (09): : 1255 - 1266
  • [46] A novel deep hashing method for fast image retrieval
    Shuli Cheng
    Huicheng Lai
    Liejun Wang
    Jiwei Qin
    The Visual Computer, 2019, 35 : 1255 - 1266
  • [47] Deep Highly Interrelated Hashing for Fast Image Retrieval
    He Z.
    Feng X.
    Liu L.
    Huang Q.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2020, 57 (11): : 2375 - 2388
  • [48] Hierarchical deep semantic hashing for fast image retrieval
    Ou, Xinyu
    Ling, Hefei
    Liu, Si
    Lei, Jie
    MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (20) : 21281 - 21302
  • [49] Fast Unmediated Hashing for Cross-Modal Retrieval
    Nie, Xiushan
    Liu, Xingbo
    Xi, Xiaoming
    Li, Chenglong
    Yin, Yilong
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (09) : 3669 - 3678
  • [50] Deep binary constraint hashing for fast image retrieval
    Li, Yang
    Miao, Zhuang
    Wang, Jiabao
    Zhang, Yafei
    ELECTRONICS LETTERS, 2018, 54 (01) : 25 - 26