BitFunnel: Revisiting Signatures for Search

被引:22
|
作者
Goodwin, Bob [1 ]
Hopcroft, Michael [1 ]
Luu, Dan [1 ]
Clemmer, Alex [2 ]
Curmei, Mihaela [1 ]
Elnikety, Sameh [1 ]
He, Yuxiong [1 ]
机构
[1] Microsoft, Redmond, WA 98052 USA
[2] Heptio, Seattle, WA USA
关键词
Signature Files; Search Engines; Inverted Indexes; Intersection; Bitvector; Bloom Filters; Bit-Sliced Signatures; Query Processing; INVERTED FILES;
D O I
10.1145/3077136.3080789
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Since the mid-90s there has been a widely-held belief that signature files are inferior to inverted files for text indexing. In recent years the Bing search engine has developed and deployed an index based on bit-sliced signatures. This index, known as BitFunnel, replaced an existing production system based on an inverted index. The driving factor behind the shift away from the inverted index was operational cost savings. This paper describes algorithmic innovations and changes in the cloud computing landscape that led us to reconsider and eventually field a technology that was once considered unusable. The BitFunnel algorithm directly addresses four fundamental limitations in bit-sliced block signatures. At the same time, our mapping of the algorithm onto a cluster offers opportunities to avoid other costs associated with signatures. We show these innovations yield a significant efficiency gain versus classic bit-sliced signatures and then compare BitFunnel with Partitioned Elias-Fano Indexes, MG4J, and Lucene.
引用
收藏
页码:605 / 614
页数:10
相关论文
共 50 条
  • [31] Revisiting "In Search of Excellence: A Portfolio Management Perspective"
    Bannister, Barry B.
    Cantor, Jesse B.
    JOURNAL OF INVESTING, 2013, 22 (03): : 21 - 27
  • [32] Revisiting Lexical Signatures to (Re-)Discover Web Pages
    Klein, Martin
    Nelson, Michael L.
    RESEARCH AND ADVANCED TECHNOLOGY FOR DIGITAL LIBRARIES, 2008, 5173 : 371 - 382
  • [33] Revisiting the variable memory model of visual search
    Horowitz, Todd S.
    VISUAL COGNITION, 2006, 14 (4-8) : 668 - 684
  • [34] Revisiting IR Techniques for Collaborative Search Strategies
    Joho, Hideo
    Hannah, David
    Jose, Joemon M.
    ADVANCES IN INFORMATION RETRIEVAL, PROCEEDINGS, 2009, 5478 : 66 - +
  • [35] Search for exospheric signatures from transiting planets
    Iro, N
    Coustenis, A
    Moutou, C
    Lajous, N
    Mayor, M
    Queloz, D
    Extrasolar Planets: Today and Tomorrow, 2004, 321 : 209 - 210
  • [36] The Rule Search Method for Association Signatures on NetDPI
    Liao, Ming-Yi
    Cheng, Tsung-Sheng
    Yang, Chu-Sing
    INTELLIGENT SYSTEMS AND APPLICATIONS (ICS 2014), 2015, 274 : 1426 - 1435
  • [37] Revisiting the category effect: The influence of meaning and search strategy on the efficiency of visual search
    Smilek, D
    Dixon, MJ
    Merikle, PM
    BRAIN RESEARCH, 2006, 1080 : 73 - 90
  • [38] Recent Progress in Search for Dark Sector Signatures
    Deliyergiyev, Maksym
    OPEN PHYSICS, 2016, 14 (01): : 281 - 303
  • [39] X-SEARCH: Revisiting Private Web Search using Intel SGX
    Ben Mokhtar, Sonia
    Boutet, Antoine
    Felber, Pascal
    Pasin, Marcelo
    Pires, Rafael
    Schiavoni, Valerio
    PROCEEDINGS OF THE 2017 INTERNATIONAL MIDDLEWARE CONFERENCE (MIDDLEWARE'17), 2017, : 198 - 208
  • [40] MAGIC and the search for signatures of supersymmetric dark matter
    Elsässer, D
    Mannheim, K
    NEW ASTRONOMY REVIEWS, 2005, 49 (2-6) : 297 - 301