Detecting Frozen Phrases in Open-Domain Question Answering

被引:2
|
作者
Yadegari, Mostafa [1 ]
Kamalloo, Ehsan [1 ]
Rafiei, Davood [1 ]
机构
[1] Univ Alberta, Edmonton, AB, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Open-domain question answering; Information retrieval; Frozen phrases; Sparse retriever; Question paraphrasing;
D O I
10.1145/3477495.3531793
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
There is essential information in the underlying structure of words and phrases in natural language questions, and this structure has been extensively studied. In this paper, we study one particular structure, referred to as frozen phrases, that is highly expected to transfer as a whole from questions to answer passages. Frozen phrases, if detected, can be helpful in open-domain Question Answering (QA) where identifying the localized context of a given input question is crucial. An interesting question is if frozen phrases can be accurately detected. We cast the problem as a sequence-labeling task and create synthetic data from existing QA datasets to train a model. We further plug this model into a sparse retriever that is made aware of the detected phrases. Our experiments reveal that detecting frozen phrases whose presence in answer documents are highly plausible yields significant improvements in retrievals as well as in the end-to-end accuracy of open-domain QA models.
引用
收藏
页码:1990 / 1996
页数:7
相关论文
共 50 条
  • [41] Efficient Passage Retrieval with Hashing for Open-domain Question Answering
    Yamada, Ikuya
    Asai, Akari
    Hajishirzi, Hannaneh
    [J]. ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2, 2021, : 979 - 986
  • [42] Generation-Augmented Retrieval for Open-Domain Question Answering
    Mao, Yuning
    He, Pengcheng
    Liu, Xiaodong
    Shen, Yelong
    Gao, Jianfeng
    Han, Jiawei
    Chen, Weizhu
    [J]. 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 4089 - 4100
  • [43] End-to-End Open-Domain Question Answering with BERTserini
    Yang, Wei
    Xie, Yuqing
    Lin, Aileen
    Li, Xingyu
    Tan, Luchen
    Xiong, Kun
    Li, Ming
    Lin, Jimmy
    [J]. NAACL HLT 2019: THE 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES: PROCEEDINGS OF THE DEMONSTRATIONS SESSION, 2019, : 72 - 77
  • [44] To Adapt or to Annotate: Challenges and Interventions for Domain Adaptation in Open-Domain Question Answering
    Dua, Dheeru
    Strubell, Emma
    Singh, Sameer
    Verga, Pat
    [J]. PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 14429 - 14446
  • [45] Neural Ranking with Weak Supervision for Open-Domain Question Answering : A Survey
    Shen, Xiaoyu
    Vakulenko, Svitlana
    del Tredici, Marco
    Barlacchi, Gianni
    Byrne, Bill
    de Gispert, Adria
    [J]. 17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 1736 - 1750
  • [46] Evaluating Open-Domain Question Answering in the Era of Large Language Models
    Kamalloo, Ehsan
    Dziri, Nouha
    Clarke, Charles L. A.
    Rafiei, Davood
    [J]. PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 5591 - 5606
  • [47] Performance issues and error analysis in an open-domain question answering system
    Moldovan, D
    Pasca, M
    Harabagiu, S
    Surdeanu, M
    [J]. 40TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 2002, : 33 - 40
  • [48] DuReadervis: A Chinese Dataset for Open-domain Document Visual Question Answering
    Qi, Le
    Lv, Shangwen
    Li, Hongyu
    Liu, Jing
    Zhang, Yu
    She, Qiaoqiao
    Wu, Hua
    Wang, Haifeng
    Liu, Ting
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 1338 - 1351
  • [49] Performance issues and error analysis in an open-domain Question Answering system
    Moldovan, D
    Pasca, M
    Harabagiu, S
    Surdeanu, M
    [J]. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2003, 21 (02) : 133 - 154
  • [50] A Copy-Augmented Generative Model for Open-Domain Question Answering
    Liu, Shuang
    Wang, Dong
    Li, Xiaoguang
    Huang, Minghui
    Ding, Meizhen
    [J]. PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022): (SHORT PAPERS), VOL 2, 2022, : 435 - 441