HAS-QA: Hierarchical Answer Spans Model for Open-Domain Question Answering

被引:0
|
作者
Pang, Liang [1 ,2 ]
Lan, Yanyan [1 ,2 ,3 ]
Guo, Jiafeng [1 ,2 ]
Xu, Jun [1 ,2 ]
Su, Lixin [1 ,2 ]
Cheng, Xueqi [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, CAS Key Lab Network Data Sci & Technol, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
[3] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USA
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper is concerned with open-domain question answering (i.e., OpenQA). Recently, some works have viewed this problem as a reading comprehension (RC) task, and directly applied successful RC models to it. However, the performances of such models are not so good as that in the RC task. In our opinion, the perspective of RC ignores three characteristics in OpenQA task: 1) many paragraphs without the answer span are included in the data collection; 2) multiple answer spans may exist within one given paragraph; 3) the end position of an answer span is dependent with the start position. In this paper, we first propose a new probabilistic formulation of OpenQA, based on a three-level hierarchical structure, i.e., the question level, the paragraph level and the answer span level. Then a Hierarchical Answer Spans Model (HAS-QA) is designed to capture each probability. HAS-QA has the ability to tackle the above three problems, and experiments on public OpenQA datasets show that it significantly outperforms traditional RC baselines and recent OpenQA baselines.
引用
收藏
页码:6875 / 6882
页数:8
相关论文
共 50 条
  • [1] Dense Hierarchical Retrieval for Open-Domain Question Answering
    Liu, Ye
    Hashimoto, Kazuma
    Zhou, Yingbo
    Yavuz, Semih
    Xiong, Caiming
    Yu, Philip S.
    [J]. FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 188 - 200
  • [2] What's in a Name? Answer Equivalence For Open-Domain Question Answering
    Si, Chenglei
    Zhao, Chen
    Boyd-Graber, Jordan
    [J]. 2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 9623 - 9629
  • [3] Ranking Paragraphs for Improving Answer Recall in Open-Domain Question Answering
    Lee, Jinhyuk
    Yun, Seongjun
    Kim, Hyunjae
    Ko, Miyoung
    Kang, Jaewoo
    [J]. 2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 565 - 569
  • [4] Advances in open-domain question answering
    Zhang, Zhi-Chang
    Zhang, Yu
    Liu, Ting
    Li, Sheng
    [J]. Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2009, 37 (05): : 1058 - 1069
  • [5] L2R-QA: An Open-Domain Question Answering Framework
    He, Tieke
    Li, Yu
    Zou, Zhipeng
    Wu, Qing
    [J]. INTELLIGENCE SCIENCE AND BIG DATA ENGINEERING: BIG DATA AND MACHINE LEARNING, PT II, 2019, 11936 : 151 - 162
  • [6] Question and Answer Test-Train Overlap in Open-Domain Question Answering Datasets
    Lewis, Patrick
    Stenetorp, Pontus
    Riedel, Sebastian
    [J]. 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 1000 - 1008
  • [7] Advances in question classification for open-domain question answering
    School of Computer Science and Technology, Anhui University of Technology, Maanshan
    Anhui
    243002, China
    不详
    Jiangsu
    210023, China
    [J]. Tien Tzu Hsueh Pao, 8 (1627-1636):
  • [8] SF-QA: Simple and Fair Evaluation Library for Open-domain Question Answering
    Lu, Xiaopeng
    Lee, Kyusong
    Zhao, Tiancheng
    [J]. EACL 2021: THE 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: PROCEEDINGS OF THE SYSTEM DEMONSTRATIONS, 2021, : 7 - 13
  • [9] Type checking in open-domain question answering
    Schlobach, S
    Olsthoorn, M
    de Rijke, M
    [J]. ECAI 2004: 16TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2004, 110 : 398 - 402
  • [10] Passage filtering for open-domain Question Answering
    Noguera, Elisa
    Llopis, Fernando
    Ferrandez, Antonio
    [J]. ADVANCES IN NATURAL LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4139 : 534 - 540