Neural Ranking with Weak Supervision for Open-Domain Question Answering : A Survey

被引:0
|
作者
Shen, Xiaoyu [1 ]
Vakulenko, Svitlana [1 ]
del Tredici, Marco [1 ]
Barlacchi, Gianni [1 ]
Byrne, Bill [1 ,2 ]
de Gispert, Adria [1 ]
机构
[1] Amazon Alexa AI, Seattle, WA 98121 USA
[2] Univ Cambridge, Cambridge, England
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural ranking (NR) has become a key component for open-domain question-answering in order to access external knowledge. However, training a good NR model requires substantial amounts of relevance annotations, which is very costly to scale. To address this, a growing body of research works have been proposed to reduce the annotation cost by training the NR model with weak supervision (WS) instead. These works differ in what resources they require and employ a diverse set of WS signals to train the model. Understanding such differences is crucial for choosing the right WS technique. To facilitate this understanding, we provide a structured overview of standard WS signals used for training a NR model. Based on their required resources, we divide them into three main categories: (1) only documents are needed; (2) documents and questions are needed; and (3) documents and question-answer pairs are needed. For every WS signal, we review its general idea and choices. Promising directions are outlined for future research.
引用
收藏
页码:1736 / 1750
页数:15
相关论文
共 50 条
  • [1] Ranking and Sampling in Open-Domain Question Answering
    Xu, Yanfu
    Lin, Zheng
    Liu, Yuanxin
    Liu, Rui
    Wang, Weiping
    Meng, Dan
    [J]. 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 2412 - 2421
  • [2] Ranking Paragraphs for Improving Answer Recall in Open-Domain Question Answering
    Lee, Jinhyuk
    Yun, Seongjun
    Kim, Hyunjae
    Ko, Miyoung
    Kang, Jaewoo
    [J]. 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 565 - 569
  • [3] Type checking in open-domain question answering
    Schlobach, S
    Olsthoorn, M
    de Rijke, M
    [J]. ECAI 2004: 16TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2004, 110 : 398 - 402
  • [4] Passage filtering for open-domain Question Answering
    Noguera, Elisa
    Llopis, Fernando
    Ferrandez, Antonio
    [J]. ADVANCES IN NATURAL LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4139 : 534 - 540
  • [5] A Light Ranker for Open-Domain Question Answering
    Qiu, Boyu
    Xu, Jungang
    Chen, Xu
    Sun, Yingfei
    [J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [6] PyGaggle: A Gaggle of Resources for Open-Domain Question Answering
    Pradeep, Ronak
    Chen, Haonan
    Gu, Lingwei
    Tamber, Manveer Singh
    Lin, Jimmy
    [J]. ADVANCES IN INFORMATION RETRIEVAL, ECIR 2023, PT III, 2023, 13982 : 148 - 162
  • [7] Adaptive Information Seeking for Open-Domain Question Answering
    Zhu, Yunchang
    Pang, Liang
    Lan, Yanyan
    Shen, Huawei
    Cheng, Xueqi
    [J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 3615 - 3626
  • [8] RRQA: reconfirmed reader for open-domain question answering
    Li, Shi
    Zhang, Wenqian
    [J]. APPLIED INTELLIGENCE, 2023, 53 (15) : 18420 - 18430
  • [9] ODSQA: OPEN-DOMAIN SPOKEN QUESTION ANSWERING DATASET
    Lee, Chia-Hsuan
    Wang, Shang-Ming
    Chang, Huan-Cheng
    Lee, Hung-Yi
    [J]. 2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 949 - 956
  • [10] Dense Hierarchical Retrieval for Open-Domain Question Answering
    Liu, Ye
    Hashimoto, Kazuma
    Zhou, Yingbo
    Yavuz, Semih
    Xiong, Caiming
    Yu, Philip S.
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 188 - 200