Beyond IID: Three Levels of Generalization for Question Answering on Knowledge Bases

被引:49
|
作者
Gu, Yu [1 ]
Kase, Sue [2 ]
Vanni, Michelle T. [2 ]
Sadler, Brian M. [2 ]
Liang, Percy [3 ]
Yan, Xifeng [4 ]
Su, Yu [1 ]
机构
[1] Ohio State Univ, Columbus, OH 43210 USA
[2] US Army Res Lab, Aberdeen Proving Ground, MD USA
[3] Stanford Univ, Stanford, CA 94305 USA
[4] Univ Calif Santa Barbara, Santa Barbara, CA 93106 USA
基金
美国国家科学基金会;
关键词
Knowledge Base; Question Answering; Semantic Parsing;
D O I
10.1145/3442381.3449992
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Existing studies on question answering on knowledge bases (KBQA) mainly operate with the standard i.i.d. assumption, i.e., training distribution over questions is the same as the test distribution. However, i.i.d. may be neither achievable nor desirable on large-scale KBs because 1) true user distribution is hard to capture and 2) randomly sampling training examples from the enormous space would be data-inefficient. Instead, we suggest that KBQA models should have three levels of built-in generalization: i.i.d., compositional, and zero-shot. To facilitate the development of KBQA models with stronger generalization, we construct and release a new large-scale, high-quality dataset with 64,331 questions, GRAILQA, and provide evaluation settings for all three levels of generalization. In addition, we propose a novel BERT-based KBQA model. The combination of our dataset and model enables us to thoroughly examine and demonstrate, for the first time, the key role of pre-trained contextual embeddings like BERT in the generalization of KBQA.(1)
引用
收藏
页码:3477 / 3488
页数:12
相关论文
共 50 条
  • [1] Question Answering over Knowledge Bases
    Liu, Kang
    Zhao, Jun
    He, Shizhu
    Zhang, Yuanzhe
    [J]. IEEE INTELLIGENT SYSTEMS, 2015, 30 (05) : 26 - 35
  • [2] Question Answering over Knowledge Bases
    Siciliani, Lucia
    [J]. SEMANTIC WEB: ESWC 2018 SATELLITE EVENTS, 2018, 11155 : 283 - 293
  • [3] Interpretable Question Answering on Knowledge Bases and Text
    Sydorova, Alona
    Poerner, Nina
    Roth, Benjamin
    [J]. 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, : 4943 - 4951
  • [4] Question Answering with Knowledge Base, Web and Beyond
    Yih, Wen-tau
    Ma, Hao
    [J]. SIGIR'16: PROCEEDINGS OF THE 39TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2016, : 1219 - 1221
  • [5] A Comparative Study of Question Answering over Knowledge Bases
    Khiem Vinh Tran
    Hao Phu Phan
    Khang Nguyen Duc Quach
    Ngan Luu-Thuy Nguyen
    Jo, Jun
    Thanh Tam Nguyen
    [J]. ADVANCED DATA MINING AND APPLICATIONS (ADMA 2022), PT I, 2022, 13725 : 259 - 274
  • [6] Systematic review of question answering over knowledge bases
    Pereira, Arnaldo
    Trifan, Alina
    Lopes, Rui Pedro
    Oliveira, Jose Luis
    [J]. IET SOFTWARE, 2022, 16 (01) : 1 - 13
  • [7] TEQUILA: Temporal Question Answering over Knowledge Bases
    Jia, Zhen
    Abujabal, Abdalghani
    Roy, Rishiraj Saha
    Stroetgen, Jannik
    Weikum, Gerhard
    [J]. CIKM'18: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2018, : 1807 - 1810
  • [8] Gathering Knowledge for Question Answering Beyond Named Entities
    Przybyla, Piotr
    [J]. NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, NLDB 2015, 2015, 9103 : 412 - 417
  • [9] Beyond NED: Fast and Effective Search Space Reduction for Complex Question Answering over Knowledge Bases
    Christmann, Philipp
    Roy, Rishiraj Saha
    Weikum, Gerhard
    [J]. WSDM'22: PROCEEDINGS OF THE FIFTEENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2022, : 172 - 180
  • [10] Skeleton parsing for complex question answering over knowledge bases
    Sun, Yawei
    Li, Pengwei
    Cheng, Gong
    Qu, Yuzhong
    [J]. JOURNAL OF WEB SEMANTICS, 2022, 72