Bridging the Gap between Synthetic and Natural Questions via Sentence Decomposition for Semantic Parsing

被引:0
|
作者
Niu, Yilin [1 ,2 ,3 ]
Huang, Fei [1 ,2 ,3 ]
Liu, Wei [4 ]
Cui, Jianwei [4 ]
Wang, Bin [4 ]
Huang, Minlie [1 ,2 ,3 ]
机构
[1] Tsinghua Univ, CoAI Lab, DCST, Beijing, Peoples R China
[2] Inst Artificial Intelligence, State Key Lab Intelligent Technol & Syst, Beijing, Peoples R China
[3] Beijing Natl Res Ctr Informat Sci & Technol, Beijing, Peoples R China
[4] Xiaomi AI Lab, Beijing, Peoples R China
基金
美国国家科学基金会;
关键词
Computational linguistics - Knowledge based systems - Natural language processing systems - Syntactics - Zero-shot learning;
D O I
10.1162/tacl_a_00552
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semantic parsing maps natural language questions into logical forms, which can be executed against a knowledge base for answers. In real-world applications, the performance of a parser is often limited by the lack of training data. To facilitate zero-shot learning, data synthesis has been widely studied to automatically generate paired questions and logical forms. However, data synthesis methods can hardly cover the diverse structures in natural languages, leading to a large gap in sentence structure between synthetic and natural questions. In this paper, we propose a decomposition-based method to unify the sentence structures of questions, which benefits the generalization to natural questions. Experiments demonstrate that our method significantly improves the semantic parser trained on synthetic data (+7.9% on KQA and +8.9% on ComplexWebQuestions in terms of exact match accuracy). Extensive analysis demonstrates that our method can better generalize to natural questions with novel text expressions compared with baselines. Besides semantic parsing, our idea potentially benefits other semantic understanding tasks by mitigating the distracting structure features. To illustrate this, we extend our method to the task of sentence embedding learning, and observe substantial improvements on sentence retrieval (+13.1% for Hit@1).
引用
收藏
页码:367 / 383
页数:17
相关论文
共 50 条
  • [1] Bridging the gap between semantic and pragmatic
    Mathieu, P
    Routier, JC
    Secq, Y
    [J]. IKE'03: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE ENGINEERING, VOLS 1 AND 2, 2003, : 308 - 312
  • [2] Bridging the Semantic Gap via Functional Brain Imaging
    Hu, Xintao
    Li, Kaiming
    Han, Junwei
    Hua, Xiansheng
    Guo, Lei
    Liu, Tianming
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2012, 14 (02) : 314 - 325
  • [3] Bridging the vocabulary gap between questions and answer sentences
    Momtazi, Saeedeh
    Klakow, Dietrich
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2015, 51 (05) : 595 - 615
  • [4] Bridging the Gap between Linked Data and the Semantic Desktop
    Groza, Tudor
    Dragan, Laura
    Handschuh, Siegfried
    Decker, Stefan
    [J]. SEMANTIC WEB - ISWC 2009, PROCEEDINGS, 2009, 5823 : 827 - +
  • [5] Bridging the Semantic Gap Between Image Contents and Tags
    Ma, Hao
    Zhu, Jianke
    Lyu, Michael Rung-Tsong
    King, Irwin
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2010, 12 (05) : 462 - 473
  • [6] Bridging the Gap Between Semantic Segmentation and Instance Segmentation
    Yin, Chengxiang
    Tang, Jian
    Yuan, Tongtong
    Xu, Zhiyuan
    Wang, Yanzhi
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 4183 - 4196
  • [7] BRIDGING THE GAP BETWEEN NATURAL AND ARTIFICIAL PHOTOSYNTHESIS
    NORRIS, JR
    GAST, P
    [J]. JOURNAL OF PHOTOCHEMISTRY, 1985, 29 (1-2): : 185 - 194
  • [8] Bridging the Gap Between Consumers' Medication Questions and Trusted Answers
    Ben Abacha, Asma
    Mrabet, Yassine
    Sharp, Mark
    Goodwin, Travis R.
    Shooshan, Sonya E.
    Demner-Fushman, Dina
    [J]. MEDINFO 2019: HEALTH AND WELLBEING E-NETWORKS FOR ALL, 2019, 264 : 25 - 29
  • [9] Bridging the gap between systems biology and synthetic biology
    Liu, Di
    Hoynes-O'Connor, Allison
    Zhang, Fuzhong
    [J]. FRONTIERS IN MICROBIOLOGY, 2013, 4
  • [10] Semantic Mapping between Natural Language Questions and SQL Queries via Syntactic Pairing
    Giordani, Alessandra
    Moschitti, Alessandro
    [J]. NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, 2010, 5723 : 207 - 221