Efficient fuzzy full-text type-ahead search

被引:0
|
作者
Guoliang Li
Shengyue Ji
Chen Li
Jianhua Feng
机构
[1] Tsinghua University,Department of Computer Science
[2] University of California,Department of Computer Science
来源
The VLDB Journal | 2011年 / 20卷
关键词
Auto complete; Full-text search; Type-ahead search; Fuzzy search;
D O I
暂无
中图分类号
学科分类号
摘要
Traditional information systems return answers after a user submits a complete query. Users often feel “left in the dark” when they have limited knowledge about the underlying data and have to use a try-and-see approach for finding information. A recent trend of supporting autocomplete in these systems is a first step toward solving this problem. In this paper, we study a new information-access paradigm, called “type-ahead search” in which the system searches the underlying data “on the fly” as the user types in query keywords. It extends autocomplete interfaces by allowing keywords to appear at different places in the underlying data. This framework allows users to explore data as they type, even in the presence of minor errors. We study research challenges in this framework for large amounts of data. Since each keystroke of the user could invoke a query on the backend, we need efficient algorithms to process each query within milliseconds. We develop various incremental-search algorithms for both single-keyword queries and multi-keyword queries, using previously computed and cached results in order to achieve a high interactive speed. We develop novel techniques to support fuzzy search by allowing mismatches between query keywords and answers. We have deployed several real prototypes using these techniques. One of them has been deployed to support type-ahead search on the UC Irvine people directory, which has been used regularly and well received by users due to its friendly interface and high efficiency.
引用
收藏
页码:617 / 640
页数:23
相关论文
共 50 条
  • [41] Enhancing HDFS with a full-text search system for massive small files
    Xu, Wentao
    Zhao, Xin
    Lao, Bin
    Nong, Ge
    JOURNAL OF SUPERCOMPUTING, 2021, 77 (07): : 7149 - 7170
  • [42] Enhancing HDFS with a full-text search system for massive small files
    Wentao Xu
    Xin Zhao
    Bin Lao
    Ge Nong
    The Journal of Supercomputing, 2021, 77 : 7149 - 7170
  • [43] One approach for full-text search of files in MongoDB based systems
    Kelec, Aleksandar
    Dujlovic, Igor
    Obradovic, Nikola
    2019 18TH INTERNATIONAL SYMPOSIUM INFOTEH-JAHORINA (INFOTEH), 2019,
  • [44] Improving Bilingual Search Performance Using Compact Full-Text Indices
    Costa, Jorge
    Gomes, Luis
    Lopes, Gabriel P.
    Russo, Luis M. S.
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING (CICLING 2015), PT I, 2015, 9041 : 582 - 595
  • [45] Full-text search engine with suffix index for massive heterogeneous data
    Xu, Wentao
    Chen, Haoyu
    Huan, Yidong
    Hu, Xuedong
    Nong, Ge
    INFORMATION SYSTEMS, 2022, 104
  • [46] TRMeister: a DBMS with high-performance full-text search functions
    Ikeda, T
    Mano, H
    Itoh, H
    Takegawa, H
    Hiraoka, T
    Horibe, S
    Ogawa, Y
    ICDE 2005: 21ST INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2005, : 958 - 967
  • [47] SEARCHING FULL-TEXT DATABASES
    TENOPIR, C
    LIBRARY JOURNAL, 1988, 113 (08) : 60 - 61
  • [48] Full-text searching in Perl
    Kientzle, T
    DR DOBBS JOURNAL, 1999, 24 (01): : 34 - +
  • [49] Fast and Exact Nearest Neighbor Search in Hamming Space on Full-Text Search Engines
    Mu, Cun
    Zhao, Jun
    Yang, Guang
    Yang, Binwei
    Yan, Zheng
    SIMILARITY SEARCH AND APPLICATIONS (SISAP 2019), 2019, 11807 : 49 - 56