BitFunnel: Revisiting Signatures for Search

被引:22
|
作者
Goodwin, Bob [1 ]
Hopcroft, Michael [1 ]
Luu, Dan [1 ]
Clemmer, Alex [2 ]
Curmei, Mihaela [1 ]
Elnikety, Sameh [1 ]
He, Yuxiong [1 ]
机构
[1] Microsoft, Redmond, WA 98052 USA
[2] Heptio, Seattle, WA USA
关键词
Signature Files; Search Engines; Inverted Indexes; Intersection; Bitvector; Bloom Filters; Bit-Sliced Signatures; Query Processing; INVERTED FILES;
D O I
10.1145/3077136.3080789
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Since the mid-90s there has been a widely-held belief that signature files are inferior to inverted files for text indexing. In recent years the Bing search engine has developed and deployed an index based on bit-sliced signatures. This index, known as BitFunnel, replaced an existing production system based on an inverted index. The driving factor behind the shift away from the inverted index was operational cost savings. This paper describes algorithmic innovations and changes in the cloud computing landscape that led us to reconsider and eventually field a technology that was once considered unusable. The BitFunnel algorithm directly addresses four fundamental limitations in bit-sliced block signatures. At the same time, our mapping of the algorithm onto a cluster offers opportunities to avoid other costs associated with signatures. We show these innovations yield a significant efficiency gain versus classic bit-sliced signatures and then compare BitFunnel with Partitioned Elias-Fano Indexes, MG4J, and Lucene.
引用
收藏
页码:605 / 614
页数:10
相关论文
共 50 条
  • [41] Efficient search of combinatorial maps using signatures
    Gosselin, Stephane
    Damiand, Guillaume
    Solnon, Christine
    THEORETICAL COMPUTER SCIENCE, 2011, 412 (15) : 1392 - 1405
  • [42] In search of definitive signatures of the elusive NCCO radical
    Simmonett, Andrew C.
    Evangelista, Francesco A.
    Allen, Wesley D.
    Schaefer, Henry F., III
    JOURNAL OF CHEMICAL PHYSICS, 2007, 127 (01):
  • [43] Search for signatures of sterile neutrinos with Double Chooz
    T. Abrahão
    H. Almazan
    J. C. dos Anjos
    S. Appel
    J. C. Barriere
    I. Bekman
    T. J. C. Bezerra
    L. Bezrukov
    E. Blucher
    T. Brugière
    C. Buck
    J. Busenitz
    A. Cabrera
    M. Cerrada
    E. Chauveau
    P. Chimenti
    O. Corpace
    J. V. Dawson
    Z. Djurcic
    A. Etenko
    H. Furuta
    I. Gil-Botella
    A. Givaudan
    H. Gomez
    L. F. G. Gonzalez
    M. C. Goodman
    T. Hara
    J. Haser
    D. Hellwig
    L. Heuermann
    A. Hourlier
    M. Ishitsuka
    J. Jochum
    C. Jollet
    K. Kale
    M. Kaneda
    M. Karakac
    T. Kawasaki
    E. Kemp
    H. de Kerret
    D. Kryn
    M. Kuze
    T. Lachenmaier
    C. E. Lane
    T. Lasserre
    C. Lastoria
    D. Lhuillier
    H. P. Lima Jr
    M. Lindner
    J. M. López-Casta no
    The European Physical Journal C, 2021, 81
  • [44] Search for signatures of sterile neutrinos with Double Chooz
    Almazan, H.
    dos Anjos, J. C.
    Appel, S.
    Barriere, J. C.
    Bekman, I
    Bezerra, T. J. C.
    Bezrukov, L.
    Blucher, E.
    Brugiere, T.
    Buck, C.
    Busenitz, J.
    Cabrera, A.
    Cerrada, M.
    Chauveau, E.
    Chimenti, P.
    Corpace, O.
    Dawson, J., V
    Djurcic, Z.
    Etenko, A.
    Furuta, H.
    Gil-Botella, I
    Givaudan, A.
    Gomez, H.
    Gonzalez, L. F. G.
    Goodman, M. C.
    Hara, T.
    Haser, J.
    Hellwig, D.
    Heuermann, L.
    Hourlier, A.
    Ishitsuka, M.
    Jochum, J.
    Jollet, C.
    Kale, K.
    Kaneda, M.
    Karakac, M.
    Kawasaki, T.
    Kemp, E.
    de Kerret, H.
    Kryn, D.
    Kuze, M.
    Lachenmaier, T.
    Lane, C. E.
    Lasserre, T.
    Lastoria, C.
    Lhuillier, D.
    Lima, H. P., Jr.
    Lindner, M.
    LoSecco, J. M.
    Lubsandorzhiev, B.
    EUROPEAN PHYSICAL JOURNAL C, 2021, 81 (08):
  • [45] TimeExplorer: Similarity Search Time Series by Their Signatures
    Tuan Nhon Dang
    Wilkinson, Leland
    ADVANCES IN VISUAL COMPUTING, ISVC 2013, PT I, 2013, 8033 : 280 - 289
  • [46] Applications in the search for genomic selection signatures in fish
    Lopez, Maria E.
    Neira, Roberto
    Yanez, Jose M.
    FRONTIERS IN GENETICS, 2015, 5
  • [47] SEARCH FOR STANDARD AND EXOTIC SUPERSYMMETRY SIGNATURES AT CMS
    Zalewski, Piotr
    ACTA PHYSICA POLONICA B, 2010, 41 (07): : 1647 - 1659
  • [48] A sensitive search for life signatures in the martian atmosphere
    Villanueva, Geronimo L.
    Mumma, Michael J.
    Novak, Robert E.
    Hewagama, Tilak
    Bonev, Boncho P.
    DiSanti, Michael A.
    ASTROBIOLOGY, 2007, 7 (03) : 497 - 497
  • [49] Revisiting Gilbert Strang's "A Chaotic Search for i"
    Li, Ao
    Corless, Robert M.
    ACM COMMUNICATIONS IN COMPUTER ALGEBRA, 2019, 53 (01): : 1 - 22
  • [50] Revisiting the Pushchino RRAT search using a neural network
    Tyul'bashev, S. A.
    Pervukhin, D., V
    Kitaeva, M. A.
    Tyul'basheva, G. E.
    Brylyakova, E. A.
    Chernosov, A., V
    ASTRONOMY & ASTROPHYSICS, 2022, 664