Neural Ranking with Weak Supervision for Open-Domain Question Answering : A Survey

被引：0

作者：

Shen, Xiaoyu ^{[1
]}

Vakulenko, Svitlana ^{[1
]}

del Tredici, Marco ^{[1
]}

Barlacchi, Gianni ^{[1
]}

Byrne, Bill ^{[1
,2
]}

de Gispert, Adria ^{[1
]}

机构：

[1] Amazon Alexa AI, Seattle, WA 98121 USA

[2] Univ Cambridge, Cambridge, England

来源：

17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023 | 2023年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Neural ranking (NR) has become a key component for open-domain question-answering in order to access external knowledge. However, training a good NR model requires substantial amounts of relevance annotations, which is very costly to scale. To address this, a growing body of research works have been proposed to reduce the annotation cost by training the NR model with weak supervision (WS) instead. These works differ in what resources they require and employ a diverse set of WS signals to train the model. Understanding such differences is crucial for choosing the right WS technique. To facilitate this understanding, we provide a structured overview of standard WS signals used for training a NR model. Based on their required resources, we divide them into three main categories: (1) only documents are needed; (2) documents and questions are needed; and (3) documents and question-answer pairs are needed. For every WS signal, we review its general idea and choices. Promising directions are outlined for future research.

引用

页码：1736 / 1750

页数：15

共 50 条

[1] Ranking and Sampling in Open-Domain Question Answering
Xu, Yanfu
Lin, Zheng
Liu, Yuanxin
Liu, Rui
Wang, Weiping
Meng, Dan
[J]. 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 2412 - 2421
[2] Ranking Paragraphs for Improving Answer Recall in Open-Domain Question Answering
Lee, Jinhyuk
Yun, Seongjun
Kim, Hyunjae
Ko, Miyoung
Kang, Jaewoo
[J]. 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 565 - 569
[3] Type checking in open-domain question answering
Schlobach, S
Olsthoorn, M
de Rijke, M
[J]. ECAI 2004: 16TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2004, 110 : 398 - 402
[4] Passage filtering for open-domain Question Answering
Noguera, Elisa
Llopis, Fernando
Ferrandez, Antonio
[J]. ADVANCES IN NATURAL LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4139 : 534 - 540
[5] A Light Ranker for Open-Domain Question Answering
Qiu, Boyu
Xu, Jungang
Chen, Xu
Sun, Yingfei
[J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
[6] PyGaggle: A Gaggle of Resources for Open-Domain Question Answering
Pradeep, Ronak
Chen, Haonan
Gu, Lingwei
Tamber, Manveer Singh
Lin, Jimmy
[J]. ADVANCES IN INFORMATION RETRIEVAL, ECIR 2023, PT III, 2023, 13982 : 148 - 162
[7] Adaptive Information Seeking for Open-Domain Question Answering
Zhu, Yunchang
Pang, Liang
Lan, Yanyan
Shen, Huawei
Cheng, Xueqi
[J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 3615 - 3626
[8] RRQA: reconfirmed reader for open-domain question answering
Li, Shi
Zhang, Wenqian
[J]. APPLIED INTELLIGENCE, 2023, 53 (15) : 18420 - 18430
[9] ODSQA: OPEN-DOMAIN SPOKEN QUESTION ANSWERING DATASET
Lee, Chia-Hsuan
Wang, Shang-Ming
Chang, Huan-Cheng
Lee, Hung-Yi
[J]. 2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 949 - 956
[10] Dense Hierarchical Retrieval for Open-Domain Question Answering
Liu, Ye
Hashimoto, Kazuma
Zhou, Yingbo
Yavuz, Semih
Xiong, Caiming
Yu, Philip S.
[J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 188 - 200

← 1 2 3 4 5 →