Efficient Document-at-a-time and Score-at-a-time Query Evaluation for Learned Sparse Representations

被引:9
|
作者
MacKenzie, Joel [1 ]
Trotman, Andrew [2 ]
Lin, Jimmy [3 ]
机构
[1] Univ Queensland, St Lucia, Qld, Australia
[2] Univ Otago, Dept Comp Sci, POB 56, Dunedin, New Zealand
[3] Univ Waterloo, 200 Univ Ave West, Waterloo, ON N2L 3G1, Canada
基金
澳大利亚研究理事会; 加拿大自然科学与工程研究理事会;
关键词
Efficiency; indexing; query processing; learned sparse retrieval; STRATEGIES;
D O I
10.1145/3576922
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Researchers have had much recent success with ranking models based on so-called learned sparse representations generated by transformers. One crucial advantage of this approach is that such models can exploit inverted indexes for top-k retrieval, thereby leveraging decades of work on efficient query evaluation. Yet, there remain many open questions about how these learned representations fit within the existing literature, which our work aims to tackle using four representative learned sparse models. We find that impact weights generated by transformers appear to greatly reduce opportunities for skipping and early exiting optimizations in well-studied document-at-a-time (DAAT) approaches. Similarly, "off-the-shelf" application of score-at-a-time (SAAT) processing exhibits a mismatch between these weights and assumptions behind accumulator management strategies. Building on these observations, we present solutions to address deficiencies with both DAAT and SAAT approaches, yielding substantial speedups in query evaluation. Our detailed empirical analysis demonstrates that both methods lie on the effectiveness-efficiency Pareto frontier, indicating that the optimal choice for deployment depends on operational constraints.
引用
下载
收藏
页数:28
相关论文
共 50 条
  • [41] Dynamic Dense-Sparse Representations for Real-Time Question Answering
    Sun, Minyu
    Jiang, Bin
    Yang, Chao
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1445 - 1450
  • [42] AttentionAttractor: efficient video stream similarity query processing in real time
    Assent, Ira
    Krieger, Ralph
    Seidl, Thomas
    2007 IEEE 23RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2007, : 1484 - +
  • [43] Efficient Time Series Classification via Sparse Linear Combination
    Zhang, Zhenguo
    Nie, Peng
    Wen, Yanlong
    2017 14TH WEB INFORMATION SYSTEMS AND APPLICATIONS CONFERENCE (WISA 2017), 2017, : 13 - 18
  • [44] Development of an Efficient Algorithm for computation of Time Varying Sparse Signal
    Shafi, Sulthana
    Aprem, T. J.
    Joseph, George M.
    PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE (ICIS), 2016, : 72 - 76
  • [45] Constructing Compact Time Series Index for Efficient Window Query Processing
    Zhao, Jing
    Wang, Peng
    Tang, Bo
    Liu, Lu
    Wang, Chen
    Wang, Wei
    Wang, Jianmin
    2022 IEEE 38TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2022), 2022, : 3025 - 3037
  • [46] Towards Efficient Query Processing on Massive Time-Evolving Graphs
    Fard, Arash
    Abdolrashidi, Amir
    Ramaswamy, Lakshmish
    Miller, John A.
    PROCEEDINGS OF THE 2012 8TH INTERNATIONAL CONFERENCE ON COLLABORATIVE COMPUTING: NETWORKING, APPLICATIONS AND WORKSHARING (COLLABORATECOM 2012), 2012, : 567 - 574
  • [47] Poster: Time-Efficient Sparse and Lightweight Adaptation for Real-Time Mobile Applications
    Cha, Hyeongheon
    Gong, Taesik
    Lee, Sung-Ju
    PROCEEDINGS OF THE 2024 THE 22ND ANNUAL INTERNATIONAL CONFERENCE ON MOBILE SYSTEMS, APPLICATIONS AND SERVICES, MOBISYS 2024, 2024, : 728 - 729
  • [48] Top-k query evaluation in sensor networks under query response time constraint
    Liang, Weifa
    Chen, Baichen
    Yu, Jeffrey Xu
    INFORMATION SCIENCES, 2011, 181 (04) : 869 - 882
  • [49] JoLT: Jointly Learned Representations of Language and Time-Series for Clinical Time-Series Interpretation (Student Abstract)
    Cai, Yifu
    Srinivasan, Arvind
    Goswami, Mononito
    Choudhry, Arjun
    Dubrawski, Artur
    THIRTY-EIGTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 21, 2024, : 23447 - 23448
  • [50] A Robust and Efficient Sparse Time-Invariant Radon Transform in the Mixed Time-Frequency Domain
    Wang, Benfeng
    Zhang, Yingqiang
    Lu, Wenkai
    Geng, Jianhua
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2019, 57 (10): : 7558 - 7566