Real-Time Open-Domain Question Answering with Dense-Sparse Phrase Index

被引:0
|
作者
Seo, Minjoon [1 ,5 ]
Lee, Jinhyuk [6 ]
Kwiatkowski, Tom [2 ]
Parikh, Ankur P. [2 ]
Farhadi, Ali [1 ,3 ,4 ]
Hajishirzi, Hannaneh [1 ,3 ]
机构
[1] Univ Washington, Seattle, WA 98195 USA
[2] Google, Mountain View, CA USA
[3] Allen Inst AI, Seattle, WA USA
[4] XNOR AI, Seattle, WA USA
[5] NAVER, Seongnam, South Korea
[6] Korea Univ, Seoul, South Korea
基金
新加坡国家研究基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing open-domain question answering (QA) models are not suitable for real-time usage because they need to process several long documents on-demand for every input query, which is computationally prohibitive. In this paper, we introduce query-agnostic indexable representations of document phrases that can drastically speed up open-domain QA. In particular, our dense-sparse phrase encoding effectively captures syntactic, semantic, and lexical information of the phrases and eliminates the pipeline filtering of context documents. Leveraging strategies for optimizing training and inference time, our model can be trained and deployed even in a single 4-GPU server. Moreover, by representing phrases as pointers to their start and end tokens, our model indexes phrases in the entire English Wikipedia (up to 60 billion phrases) using under 2TB. Our experiments on SQuAD-Open show that our model is on par with or more accurate than previous models with 6000x reduced computational cost, which translates into at least 68x faster end-to-end inference benchmark on CPUs. Code and demo are available at nlp.cs.washington.edu/denspi
引用
收藏
页码:4430 / 4441
页数:12
相关论文
共 50 条
  • [1] Dynamic Dense-Sparse Representations for Real-Time Question Answering
    Sun, Minyu
    Jiang, Bin
    Yang, Chao
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 1445 - 1450
  • [2] Contextualized Sparse Representations for Real-Time Open-Domain Question Answering
    Lee, Jinhyuk
    Seo, Minjoon
    Hajishirzi, Hannaneh
    Kang, Jaewoo
    58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), 2020, : 912 - 919
  • [3] Progressively Pretrained Dense Corpus Index for Open-Domain Question Answering
    Xiong, Wenhan
    Wang, Hong
    Wang, William Yang
    16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 2803 - 2815
  • [4] Dense Hierarchical Retrieval for Open-Domain Question Answering
    Liu, Ye
    Hashimoto, Kazuma
    Zhou, Yingbo
    Yavuz, Semih
    Xiong, Caiming
    Yu, Philip S.
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 188 - 200
  • [5] Dense Passage Retrieval for Open-Domain Question Answering
    Karpukhin, Vladimir
    Oguz, Barlas
    Min, Sewon
    Lewis, Patrick
    Wu, Ledell
    Edunov, Sergey
    Chen, Danqi
    Yih, Wen Tau
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 6769 - 6781
  • [6] Dense-sparse discretization for optimization and real-time guidance
    Kumar, RR
    Seywald, H
    JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 1996, 19 (02) : 501 - 503
  • [7] Advances in open-domain question answering
    Zhang, Zhi-Chang
    Zhang, Yu
    Liu, Ting
    Li, Sheng
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2009, 37 (05): : 1058 - 1069
  • [8] Advances in question classification for open-domain question answering
    School of Computer Science and Technology, Anhui University of Technology, Maanshan
    Anhui
    243002, China
    不详
    Jiangsu
    210023, China
    Tien Tzu Hsueh Pao, 8 (1627-1636):
  • [9] Contrastive Refinement for Dense Retrieval Inference in the Open-Domain Question Answering Task
    Zhai, Qiuhong
    Zhu, Wenhao
    Zhang, Xiaoyu
    Liu, Chenyun
    FUTURE INTERNET, 2023, 15 (04):
  • [10] Type checking in open-domain question answering
    Schlobach, S
    Olsthoorn, M
    de Rijke, M
    ECAI 2004: 16TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2004, 110 : 398 - 402