Generation-Augmented Retrieval for Open-Domain Question Answering

被引:0
|
作者
Mao, Yuning [1 ]
He, Pengcheng [2 ]
Liu, Xiaodong [3 ]
Shen, Yelong [2 ]
Gao, Jianfeng [3 ]
Han, Jiawei [1 ]
Chen, Weizhu [2 ]
机构
[1] Univ Illinois, Urbana, IL 61801 USA
[2] Microsoft Azure AI, Seattle, WA USA
[3] Microsoft Res, Redmond, WA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose Generation-Augmented Retrieval (GAR) for answering open-domain questions, which augments a query through text generation of heuristically discovered relevant contexts without external resources as supervision. We demonstrate that the generated contexts substantially enrich the semantics of the queries and GAR with sparse representations (BM25) achieves comparable or better performance than state-of-the-art dense retrieval methods such as DPR (Karpukhin et al., 2020). We show that generating diverse contexts for a query is beneficial as fusing their results consistently yields better retrieval accuracy. Moreover, as sparse and dense representations are often complementary, GAR can be easily combined with DPR to achieve even better performance. GAR achieves state-of-the-art performance on Natural Questions and TriviaQA datasets under the extractive QA setup when equipped with an extractive reader, and consistently outperforms other retrieval methods when the same generative reader is used.
引用
收藏
页码:4089 / 4100
页数:12
相关论文
共 50 条
  • [1] Dense Hierarchical Retrieval for Open-Domain Question Answering
    Liu, Ye
    Hashimoto, Kazuma
    Zhou, Yingbo
    Yavuz, Semih
    Xiong, Caiming
    Yu, Philip S.
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 188 - 200
  • [2] Dense Passage Retrieval for Open-Domain Question Answering
    Karpukhin, Vladimir
    Oguz, Barlas
    Min, Sewon
    Lewis, Patrick
    Wu, Ledell
    Edunov, Sergey
    Chen, Danqi
    Yih, Wen Tau
    [J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 6769 - 6781
  • [3] LI-RAGE: Late Interaction Retrieval Augmented Generation with Explicit Signals for Open-Domain Table Question Answering
    Lin, Weizhe
    Blloshmi, Rexhina
    Byrne, Bill
    de Gispert, Adria
    Iglesias, Gonzalo
    [J]. 61ST CONFERENCE OF THE THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 2, 2023, : 1557 - 1566
  • [4] Efficient Passage Retrieval with Hashing for Open-domain Question Answering
    Yamada, Ikuya
    Asai, Akari
    Hajishirzi, Hannaneh
    [J]. ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2, 2021, : 979 - 986
  • [5] A Copy-Augmented Generative Model for Open-Domain Question Answering
    Liu, Shuang
    Wang, Dong
    Li, Xiaoguang
    Huang, Minghui
    Ding, Meizhen
    [J]. PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022): (SHORT PAPERS), VOL 2, 2022, : 435 - 441
  • [6] Multi-Hop Paragraph Retrieval for Open-Domain Question Answering
    Feldman, Yair
    El-Yaniv, Ran
    [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 2296 - 2309
  • [7] Improving the Domain Adaptation of Retrieval Augmented Generation (RAG) Models for Open Domain Question Answering
    Siriwardhana, Shamane
    Weerasekera, Rivindu
    Wen, Elliott
    Kaluarachchi, Tharindu
    Rana, Rajib
    Nanayakkara, Suranga
    [J]. TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2023, 11 : 1 - 17
  • [8] Type checking in open-domain question answering
    Schlobach, S
    Olsthoorn, M
    de Rijke, M
    [J]. ECAI 2004: 16TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2004, 110 : 398 - 402
  • [9] Contrastive Refinement for Dense Retrieval Inference in the Open-Domain Question Answering Task
    Zhai, Qiuhong
    Zhu, Wenhao
    Zhang, Xiaoyu
    Liu, Chenyun
    [J]. FUTURE INTERNET, 2023, 15 (04):
  • [10] An Efficient Document Retrieval for Korean Open-Domain Question Answering Based on ColBERT
    Kang, Byungha
    Kim, Yeonghwa
    Shin, Youhyun
    Mourtzis, Dimitris
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (24):