Efficient fuzzy full-text type-ahead search

被引:0
|
作者
Guoliang Li
Shengyue Ji
Chen Li
Jianhua Feng
机构
[1] Tsinghua University,Department of Computer Science
[2] University of California,Department of Computer Science
来源
The VLDB Journal | 2011年 / 20卷
关键词
Auto complete; Full-text search; Type-ahead search; Fuzzy search;
D O I
暂无
中图分类号
学科分类号
摘要
Traditional information systems return answers after a user submits a complete query. Users often feel “left in the dark” when they have limited knowledge about the underlying data and have to use a try-and-see approach for finding information. A recent trend of supporting autocomplete in these systems is a first step toward solving this problem. In this paper, we study a new information-access paradigm, called “type-ahead search” in which the system searches the underlying data “on the fly” as the user types in query keywords. It extends autocomplete interfaces by allowing keywords to appear at different places in the underlying data. This framework allows users to explore data as they type, even in the presence of minor errors. We study research challenges in this framework for large amounts of data. Since each keystroke of the user could invoke a query on the backend, we need efficient algorithms to process each query within milliseconds. We develop various incremental-search algorithms for both single-keyword queries and multi-keyword queries, using previously computed and cached results in order to achieve a high interactive speed. We develop novel techniques to support fuzzy search by allowing mismatches between query keywords and answers. We have deployed several real prototypes using these techniques. One of them has been deployed to support type-ahead search on the UC Irvine people directory, which has been used regularly and well received by users due to its friendly interface and high efficiency.
引用
收藏
页码:617 / 640
页数:23
相关论文
共 50 条
  • [21] CONCEPTS EXPLICATION OF THE HUMANITIES AND FULL-TEXT SEARCH TOOLS
    Lyapin, Sergey Kh.
    Tolstikova, Irina I.
    PSYCHOLOGY AND PSYCHIATRY, SOCIOLOGY AND HEALTHCARE, EDUCATION, VOL II, 2015, : 213 - 219
  • [22] Scalable Full-Text Search for Petascale File Systems
    Leung, Andrew W.
    Miller, Ethan L.
    PDSW'08: PROCEEDINGS OF THE 2008 3RD PETASCALE DATA STORAGE WORKSHOP, 2008, : 16 - 22
  • [23] Efficient Indexing of Regional Maximum Activations of Convolutions using Full-Text Search Engines
    Amato, Giuseppe
    Carrara, Fabio
    Falchi, Fabrizio
    Gennaro, Claudio
    PROCEEDINGS OF THE 2017 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR'17), 2017, : 425 - 428
  • [24] A method to improve full-text search performance of MongoDB
    Mesut, Altan
    Ozturk, Emir
    PAMUKKALE UNIVERSITY JOURNAL OF ENGINEERING SCIENCES-PAMUKKALE UNIVERSITESI MUHENDISLIK BILIMLERI DERGISI, 2022, 28 (05): : 720 - 729
  • [25] The design and implementation of computer full-text search engine
    Bu Zhi-jing
    Fan Yan
    Yang Jian-wen
    Cheng Lin
    2015 SEVENTH INTERNATIONAL CONFERENCE ON MEASURING TECHNOLOGY AND MECHATRONICS AUTOMATION (ICMTMA 2015), 2015, : 1163 - 1167
  • [26] TYPE-AHEAD EXPLORATORY SEARCH THROUGH TYPO AND WORD ORDER TOLERANT AUTOCOMPLETION
    Fafalios, Pavlos
    Tzitzikas, Yannis
    JOURNAL OF WEB ENGINEERING, 2015, 14 (1-2): : 80 - 116
  • [27] Full-text searching
    Olson, MA
    DR DOBBS JOURNAL, 1999, 24 (05): : 10 - 10
  • [28] THE FULL-TEXT IDEAL
    MARCUS, J
    DATABASE-THE MAGAZINE OF ELECTRONIC DATABASE REVIEWS, 1995, 18 (06): : 83 - 85
  • [29] Big Data Full-Text Search Index Minimization Using Text Summarization
    Iqbal, Waheed
    Malik, Waqas Ilyas
    Bukhari, Faisal
    Almustafa, Khaled Mohamad
    Nawaz, Zubiar
    INFORMATION TECHNOLOGY AND CONTROL, 2021, 50 (02): : 375 - 389
  • [30] ChemDB update - full-text search and virtual chemical space
    Chen, Jonathan H.
    Linstead, Erik
    Swamidass, S. Joshua
    Wang, Dennis
    Baldi, Pierre
    BIOINFORMATICS, 2007, 23 (17) : 2348 - 2351