Efficient fuzzy full-text type-ahead search

被引:0
|
作者
Guoliang Li
Shengyue Ji
Chen Li
Jianhua Feng
机构
[1] Tsinghua University,Department of Computer Science
[2] University of California,Department of Computer Science
来源
The VLDB Journal | 2011年 / 20卷
关键词
Auto complete; Full-text search; Type-ahead search; Fuzzy search;
D O I
暂无
中图分类号
学科分类号
摘要
Traditional information systems return answers after a user submits a complete query. Users often feel “left in the dark” when they have limited knowledge about the underlying data and have to use a try-and-see approach for finding information. A recent trend of supporting autocomplete in these systems is a first step toward solving this problem. In this paper, we study a new information-access paradigm, called “type-ahead search” in which the system searches the underlying data “on the fly” as the user types in query keywords. It extends autocomplete interfaces by allowing keywords to appear at different places in the underlying data. This framework allows users to explore data as they type, even in the presence of minor errors. We study research challenges in this framework for large amounts of data. Since each keystroke of the user could invoke a query on the backend, we need efficient algorithms to process each query within milliseconds. We develop various incremental-search algorithms for both single-keyword queries and multi-keyword queries, using previously computed and cached results in order to achieve a high interactive speed. We develop novel techniques to support fuzzy search by allowing mismatches between query keywords and answers. We have deployed several real prototypes using these techniques. One of them has been deployed to support type-ahead search on the UC Irvine people directory, which has been used regularly and well received by users due to its friendly interface and high efficiency.
引用
收藏
页码:617 / 640
页数:23
相关论文
共 50 条
  • [31] IMPROVING FULL-TEXT SEARCH PERFORMANCE THROUGH TEXTUAL ANALYSIS
    MOLTO, M
    INFORMATION PROCESSING & MANAGEMENT, 1993, 29 (05) : 615 - 632
  • [32] FULL-TEXT DATABASES
    SIDDIQUI, MA
    ONLINE REVIEW, 1991, 15 (06): : 367 - 372
  • [33] Full-text Search for Verifiable Credential Metadata on Distributed Ledgers
    Lux, Zoltan Andras
    Beierle, Felix
    Zickau, Sebastian
    Goendoer, Sebastian
    2019 SIXTH INTERNATIONAL CONFERENCE ON INTERNET OF THINGS: SYSTEMS, MANAGEMENT AND SECURITY (IOTSMS), 2019, : 519 - 528
  • [34] RepoVis: Visual Overviews and Full-Text Search in Software Repositories
    Feiner, Johannes
    Andrews, Keith
    2018 SIXTH IEEE WORKING CONFERENCE ON SOFTWARE VISUALIZATION (VISSOFT), 2018, : 1 - 11
  • [35] FULL-TEXT DATABASES
    TENOPIR, C
    ANNUAL REVIEW OF INFORMATION SCIENCE AND TECHNOLOGY, 1984, 19 : 215 - 246
  • [36] Humanities full-text
    Williams, H
    LIBRARY JOURNAL, 2003, 128 (05) : 124 - 124
  • [37] HAPS: Supporting Effective and Efficient Full-Text P2P Search with Peer Dynamics
    Zu-Jie Ren
    Ke Chen
    Li-Dan Shou
    Gang Chen
    Yi-Jun Bei
    Xiao-Yan Li
    Journal of Computer Science and Technology, 2010, 25 : 482 - 498
  • [38] HAPS: Supporting Effective and Efficient Full-Text P2P Search with Peer Dynamics
    Ren, Zu-Jie
    Chen, Ke
    Shou, Li-Dan
    Chen, Gang
    Bei, Yi-Jun
    Li, Xiao-Yan
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2010, 25 (03) : 482 - 498
  • [39] Efficient evaluation of distance predicates in XPath full-text query
    Chen, H
    Wang, XL
    Zhou, AY
    ADVANCED WEB AND NETWORK TECHNOLOGIES, AND APPLICATIONS, PROCEEDINGS, 2006, 3842 : 76 - 85
  • [40] Proposal of a lightweight, offline, full-text search engine for an mHealth app
    Lopes, Carla Teixeira
    Azevedo, David
    Monteiro, Joao M.
    2022 17TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI), 2022,