BitFunnel: Revisiting Signatures for Search

被引:22
|
作者
Goodwin, Bob [1 ]
Hopcroft, Michael [1 ]
Luu, Dan [1 ]
Clemmer, Alex [2 ]
Curmei, Mihaela [1 ]
Elnikety, Sameh [1 ]
He, Yuxiong [1 ]
机构
[1] Microsoft, Redmond, WA 98052 USA
[2] Heptio, Seattle, WA USA
关键词
Signature Files; Search Engines; Inverted Indexes; Intersection; Bitvector; Bloom Filters; Bit-Sliced Signatures; Query Processing; INVERTED FILES;
D O I
10.1145/3077136.3080789
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Since the mid-90s there has been a widely-held belief that signature files are inferior to inverted files for text indexing. In recent years the Bing search engine has developed and deployed an index based on bit-sliced signatures. This index, known as BitFunnel, replaced an existing production system based on an inverted index. The driving factor behind the shift away from the inverted index was operational cost savings. This paper describes algorithmic innovations and changes in the cloud computing landscape that led us to reconsider and eventually field a technology that was once considered unusable. The BitFunnel algorithm directly addresses four fundamental limitations in bit-sliced block signatures. At the same time, our mapping of the algorithm onto a cluster offers opportunities to avoid other costs associated with signatures. We show these innovations yield a significant efficiency gain versus classic bit-sliced signatures and then compare BitFunnel with Partitioned Elias-Fano Indexes, MG4J, and Lucene.
引用
收藏
页码:605 / 614
页数:10
相关论文
共 50 条
  • [21] Signatures of chaos in animal search patterns
    Andy M Reynolds
    Frederic Bartumeus
    Andrea Kölzsch
    Johan van de Koppel
    Scientific Reports, 6
  • [22] Fast similarity search on video signatures
    Cheung, SCS
    Zakhor, A
    2003 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL 2, PROCEEDINGS, 2003, : 1 - 4
  • [23] Statistical signatures of a targeted search by bacteria
    Jashnsaz, Hossein
    Anderson, Gregory G.
    Presse, Steve
    PHYSICAL BIOLOGY, 2017, 14 (06)
  • [24] Galaxy mergers: A search for chemical signatures
    Ivans, II
    Carney, B
    de Almeida, L
    Sneden, C
    DYNAMICS OF STAR CLUSTERS AND THE MILKY WAY, 2001, 228 : 461 - 463
  • [25] Electronic and digital signatures: In search of a standard
    Wells, Thomas O.
    IT Professional, 2000, 2 (03) : 24 - 30
  • [26] Search for signatures in miRNAs associated with cancer
    Kothandan, Ram
    Biswas, Sumit
    BIOINFORMATION, 2013, 9 (10) : 524 - 527
  • [27] The search for supernova signatures in an ice core
    Cole, AL
    Boyd, RN
    Davis, ME
    Thompson, LG
    Davis, AM
    Lewis, RS
    Zinner, E
    NUCLEAR PHYSICS A, 2005, 758 : 276C - 279C
  • [28] Signatures of chaos in animal search patterns
    Reynolds, Andy M.
    Bartumeus, Frederic
    Koelzsch, Andrea
    van de Koppel, Johan
    SCIENTIFIC REPORTS, 2016, 6
  • [29] Revisiting Seizure Prediction with Path Signatures: the Limitations of System Identification
    Haderlein, Jonas F.
    Peterson, Andre D. H.
    Eskikand, Parvin Zarei
    Cook, Mark J.
    Burkitt, Anthony N.
    Grayden, David B.
    Mareels, Iven M. Y.
    2024 AUSTRALIAN & NEW ZEALAND CONTROL CONFERENCE, ANZCC, 2024, : 192 - 200
  • [30] Revisiting Public Values: In Search of ... Common Decency?
    Hall, Jeremy L.
    Battaglio, R. Paul
    PUBLIC ADMINISTRATION REVIEW, 2020, 80 (02) : 185 - 187