Learning Transferable Features for Open-Domain Question Answering

被引:0
|
作者
Zuin, Gianlucca [1 ]
Chaimowicz, Luiz [1 ]
Veloso, Adriano [1 ]
机构
[1] Univ Fed Minas Gerais, Dept Comp Sci, Belo Horizonte, MG, Brazil
基金
欧盟地平线“2020”;
关键词
Question-Answering; Transfer Learning; Deep Networks;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Corpora used to learn open-domain Question-Answering (QA) models are typically collected from a wide variety of topics or domains. Since QA requires understanding natural language, open-domain QA models generally need very large training corpora. A simple way to alleviate data demand is to restrict the domain covered by the QA model, leading thus to domain-specific QA models. While learning improved QA models for a specific domain is still challenging due to the lack of sufficient training data in the topic of interest, additional training data can be obtained from related topic domains. Thus, instead of learning a single open-domain QA model, we investigate domain adaptation approaches in order to create multiple improved domain-specific QA models. We demonstrate that this can be achieved by stratifying the source dataset, without the need of searching for complementary data unlike many other domain adaptation approaches. We propose a deep architecture that jointly exploits convolutional and recurrent networks for learning domain-specific features while transferring domain-shared features. That is, we use transferable features to enable model adaptation from multiple source domains. We consider different transference approaches designed to learn span-level and sentence-level QA models. We found that domain-adaptation greatly improves sentence-level QA performance, and span-level QA benefits from sentence information. Finally, we also show that a simple clustering algorithm may be employed when the topic domains are unknown and the resulting loss in accuracy is negligible.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] Denoising Distantly Supervised Open-Domain Question Answering
    Lin, Yankai
    Ji, Haozhe
    Liu, Zhiyuan
    Sun, Maosong
    PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL), VOL 1, 2018, : 1736 - 1745
  • [22] The structure and performance of an open-domain question answering system
    Moldovan, D
    Harabagiu, S
    Pasca, M
    Mihalcea, R
    Girju, R
    Goodrum, R
    Rus, V
    38TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 2000, : 563 - 570
  • [23] AVADHAN: System for Open-Domain Telugu Question Answering
    Ravva, Priyanka
    Urlana, Ashok
    Shrivastava, Manish
    PROCEEDINGS OF THE 7TH ACM IKDD CODS AND 25TH COMAD (CODS-COMAD 2020), 2020, : 234 - 238
  • [24] Detecting Frozen Phrases in Open-Domain Question Answering
    Yadegari, Mostafa
    Kamalloo, Ehsan
    Rafiei, Davood
    PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 1990 - 1996
  • [25] Leveraging Knowledge Graph for Open-domain Question Answering
    Costa, Jose Ortiz
    Kulkarni, Anagha
    2018 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2018), 2018, : 389 - 394
  • [26] Complementary Evidence Identification in Open-Domain Question Answering
    Mou, Xiangyang
    Yu, Mo
    Chang, Shiyu
    Feng, Yufei
    Zhang, Li
    Su, Hui
    16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 2720 - 2726
  • [27] A New Approach For Open-Domain Question Answering System
    Alturani, Ibrahim Mahmoud Ibrahim
    Bin Hamzah, Mohd Pouzi
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2018, 18 (06): : 100 - 103
  • [28] Dense Passage Retrieval for Open-Domain Question Answering
    Karpukhin, Vladimir
    Oguz, Barlas
    Min, Sewon
    Lewis, Patrick
    Wu, Ledell
    Edunov, Sergey
    Chen, Danqi
    Yih, Wen Tau
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 6769 - 6781
  • [29] A dataset and baselines for sequential open-domain question answering
    Elgohary, Ahmed
    Zhao, Chen
    Boyd-Graber, Jordan
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 1077 - 1083
  • [30] Using clustering approaches to open-domain question answering
    Wu, Youzheng
    Kashioka, Hideki
    Zhao, Jun
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2007, 4394 : 506 - +