Retrieval Data Augmentation Informed by Downstream Question Answering Performance

被引:0
|
作者
Ferguson, James [1 ]
Dasigi, Pradeep [2 ]
Khot, Tushar [2 ]
Hajishirzi, Hannaneh [1 ,2 ]
机构
[1] Univ Washington, Seattle, WA 98195 USA
[2] Allen Inst AI, Seattle, WA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Training retrieval models to fetch contexts for Question Answering (QA) over large corpora requires labeling relevant passages in those corpora. Since obtaining exhaustive manual annotations of all relevant passages is not feasible, prior work uses text overlap heuristics to find passages that are likely to contain the answer, but this is not feasible when the task requires deeper reasoning and answers are not extractable spans (e.g.: multi-hop, discrete reasoning). We address this issue by identifying relevant passages based on whether they are useful for a trained QA model to arrive at the correct answers, and develop a search process guided by the QA model's loss. Our experiments show that this approach enables identifying relevant context for unseen data greater than 90% of the time on the IIRC dataset and generalizes better to the end QA task than those trained on just the gold retrieval data on IIRC and QASC datasets.
引用
收藏
页码:1 / 5
页数:5
相关论文
共 50 条
  • [21] Question retrieval using combined queries in community question answering
    Saquib Khushhal
    Abdul Majid
    Syed Ali Abbas
    Malik Sajjad Ahmed Nadeem
    Saeed Arif Shah
    [J]. Journal of Intelligent Information Systems, 2020, 55 : 307 - 327
  • [22] Improving Question Retrieval in Community Question Answering with Label Ranking
    Wang, Wei
    Li, Baichuan
    King, Irwin
    [J]. 2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2011, : 349 - 356
  • [23] Manhattan Siamese LSTM for Question Retrieval in Community Question Answering
    Othman, Nouha
    Faiz, Rim
    Smaili, Kamel
    [J]. ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS: OTM 2019 CONFERENCES, 2019, 11877 : 661 - 677
  • [24] Boosting Conversational Question Answering with Fine-Grained Retrieval-Augmentation and Self-Check
    Ye, Linhao
    Lei, Zhikai
    Yin, Jianghao
    Chen, Qin
    Zhou, Jie
    He, Liang
    [J]. PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 2301 - 2305
  • [25] Document Retrieval Based on Question Answering System
    Nguyen Tuan Dang
    Do Thi Thanh Tuyen
    [J]. ICIC 2009: SECOND INTERNATIONAL CONFERENCE ON INFORMATION AND COMPUTING SCIENCE, VOL 1, PROCEEDINGS: COMPUTING SCIENCE AND ITS APPLICATION, 2009, : 183 - +
  • [26] Evaluating passage retrieval approaches for question answering
    Robert, I
    Gaizauskas, R
    [J]. ADVANCES IN INFORMATION RETRIEVAL, PROCEEDINGS, 2004, 2997 : 72 - 84
  • [27] Document Retrieval System for Biomedical Question Answering
    Bolat, Harun
    Sen, Baha
    [J]. APPLIED SCIENCES-BASEL, 2024, 14 (06):
  • [28] Swahili Information Retrieval: A Question - Answering Approach
    Telemala, Joseph P.
    Suleman, Hussein
    [J]. PROCEEDINGS OF THE ANNUAL CONFERENCE OF THE SOUTH AFRICAN INSTITUTE OF COMPUTER SCIENTISTS AND INFORMATION TECHNOLOGISTS (SAICSIT 2018), 2018, : 345 - 345
  • [29] Yahoo! Answers for Sentence Retrieval in Question Answering
    Momtazi, Saeedeh
    Klakow, Dietrich
    [J]. LREC 2010 - SEVENTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2010, : D28 - D35
  • [30] Language independent passage retrieval for question answering
    Gómez-Soriano, JM
    Montes-y-Gómez, M
    Sanchis-Arnal, E
    Villaseñor-Pineda, L
    Rosso, P
    [J]. MICAI 2005: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2005, 3789 : 816 - 823