Learning Transferable Features for Open-Domain Question Answering

被引:0
|
作者
Zuin, Gianlucca [1 ]
Chaimowicz, Luiz [1 ]
Veloso, Adriano [1 ]
机构
[1] Univ Fed Minas Gerais, Dept Comp Sci, Belo Horizonte, MG, Brazil
基金
欧盟地平线“2020”;
关键词
Question-Answering; Transfer Learning; Deep Networks;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Corpora used to learn open-domain Question-Answering (QA) models are typically collected from a wide variety of topics or domains. Since QA requires understanding natural language, open-domain QA models generally need very large training corpora. A simple way to alleviate data demand is to restrict the domain covered by the QA model, leading thus to domain-specific QA models. While learning improved QA models for a specific domain is still challenging due to the lack of sufficient training data in the topic of interest, additional training data can be obtained from related topic domains. Thus, instead of learning a single open-domain QA model, we investigate domain adaptation approaches in order to create multiple improved domain-specific QA models. We demonstrate that this can be achieved by stratifying the source dataset, without the need of searching for complementary data unlike many other domain adaptation approaches. We propose a deep architecture that jointly exploits convolutional and recurrent networks for learning domain-specific features while transferring domain-shared features. That is, we use transferable features to enable model adaptation from multiple source domains. We consider different transference approaches designed to learn span-level and sentence-level QA models. We found that domain-adaptation greatly improves sentence-level QA performance, and span-level QA benefits from sentence information. Finally, we also show that a simple clustering algorithm may be employed when the topic domains are unknown and the resulting loss in accuracy is negligible.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] Retrieve What You Need: A Mutual Learning Framework for Open-domain Question Answering
    Wang, Dingmin
    Huang, Qiuyuan
    Jackson, Matthew
    Gao, Jianfeng
    [J]. TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2024, 12 : 247 - 263
  • [42] Efficient Passage Retrieval with Hashing for Open-domain Question Answering
    Yamada, Ikuya
    Asai, Akari
    Hajishirzi, Hannaneh
    [J]. ACL-IJCNLP 2021: THE 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 2, 2021, : 979 - 986
  • [43] End-to-End Open-Domain Question Answering with BERTserini
    Yang, Wei
    Xie, Yuqing
    Lin, Aileen
    Li, Xingyu
    Tan, Luchen
    Xiong, Kun
    Li, Ming
    Lin, Jimmy
    [J]. NAACL HLT 2019: THE 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES: PROCEEDINGS OF THE DEMONSTRATIONS SESSION, 2019, : 72 - 77
  • [44] Generation-Augmented Retrieval for Open-Domain Question Answering
    Mao, Yuning
    He, Pengcheng
    Liu, Xiaodong
    Shen, Yelong
    Gao, Jianfeng
    Han, Jiawei
    Chen, Weizhu
    [J]. 59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 4089 - 4100
  • [45] Open-Domain Why-Question Answering with Adversarial Learning to Encode Answer Texts
    Oh, Jong-Hoon
    Kadowaki, Kazuma
    Kloetzer, Julien
    Iida, Ryu
    Torisawa, Kentaro
    [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 4227 - 4237
  • [46] To Adapt or to Annotate: Challenges and Interventions for Domain Adaptation in Open-Domain Question Answering
    Dua, Dheeru
    Strubell, Emma
    Singh, Sameer
    Verga, Pat
    [J]. PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 14429 - 14446
  • [47] Neural Ranking with Weak Supervision for Open-Domain Question Answering : A Survey
    Shen, Xiaoyu
    Vakulenko, Svitlana
    del Tredici, Marco
    Barlacchi, Gianni
    Byrne, Bill
    de Gispert, Adria
    [J]. 17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 1736 - 1750
  • [48] Evaluating Open-Domain Question Answering in the Era of Large Language Models
    Kamalloo, Ehsan
    Dziri, Nouha
    Clarke, Charles L. A.
    Rafiei, Davood
    [J]. PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 5591 - 5606
  • [49] Performance issues and error analysis in an open-domain question answering system
    Moldovan, D
    Pasca, M
    Harabagiu, S
    Surdeanu, M
    [J]. 40TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 2002, : 33 - 40
  • [50] A Copy-Augmented Generative Model for Open-Domain Question Answering
    Liu, Shuang
    Wang, Dong
    Li, Xiaoguang
    Huang, Minghui
    Ding, Meizhen
    [J]. PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022): (SHORT PAPERS), VOL 2, 2022, : 435 - 441