StepGame: A New Benchmark for Robust Multi-Hop Spatial Reasoning in Texts

被引:0
|
作者
Shi, Zhengxiang [1 ]
Zhang, Qiang [2 ]
Lipani, Aldo [1 ]
机构
[1] UCL, London, England
[2] Zhejiang Univ, Hangzhou, Zhejiang, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Inferring spatial relations in natural language is a crucial ability an intelligent system should possess. The bAbI dataset tries to capture tasks relevant to this domain (task 17 and 19). However, these tasks have several limitations. Most importantly, they are limited to fixed expressions, they are limited in the number of reasoning steps required to solve them, and they fail to test the robustness of models to input that contains irrelevant or redundant information. In this paper, we present a new Question-Answering dataset called StepGame for robust multi-hop spatial reasoning in texts. Our experiments demonstrate that state-of-the-art models on the bAbI dataset struggle on the StepGame dataset. Moreover, we propose a Tensor-Product based Memory-Augmented Neural Network (TP-MANN) specialized for spatial reasoning tasks. Experimental results on both datasets show that our model outperforms all the baselines with superior generalization and robustness performance.
引用
收藏
页码:11321 / 11329
页数:9
相关论文
共 50 条
  • [31] Language Generation with Multi-Hop Reasoning on Commonsense Knowledge Graph
    Ji, Haozhe
    Ke, Pei
    Huang, Shaohan
    Wei, Furu
    Zhu, Xiaoyan
    Huang, Minlie
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 725 - 736
  • [32] Breadth First Reasoning Graph for Multi-hop Question Answering
    Huang, Yongjie
    Yang, Meng
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 5810 - 5821
  • [33] Joint reasoning-based embedded multi-hop KGQA
    Xu, Tongzhao
    Tohti, Turdi
    Liang, Yi
    Zuo, Zicheng
    Hamdulla, Askar
    Journal of Intelligent and Fuzzy Systems, 2024, 47 (5-6): : 457 - 469
  • [34] Text Reasoning Chain Extraction for Multi-Hop Question Answering
    Wang, Pengming
    Zhu, Zijiang
    Chen, Qing
    Dai, Weihuang
    TSINGHUA SCIENCE AND TECHNOLOGY, 2024, 29 (04): : 959 - 970
  • [35] Deep Inductive Logic Reasoning for Multi-Hop Reading Comprehension
    Wang, Wenya
    Pan, Sinno Jialin
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 4999 - 5009
  • [36] Robust multi-hop time synchronization in sensor networks
    Maroti, M
    Kusy, B
    Simon, G
    Ledeczi, A
    ICWN'04 & PCC'04, VOLS, 1 AND 2, PROCEEDINGS, 2004, : 454 - 460
  • [37] Multi-view Semantic Reasoning Networks for Multi-hop Question Answering
    Long X.
    Zhao R.
    Sun J.
    Ju S.
    Gongcheng Kexue Yu Jishu/Advanced Engineering Sciences, 2023, 55 (02): : 285 - 297
  • [38] Cooperative spatial multiplexing in multi-hop wireless networks
    Zhang, Yimin
    Wang, Genyuan
    Amin, Moeness G.
    2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 4491 - 4494
  • [39] Optimal Spatial Reuse in Poisson Multi-hop Networks
    Stamatiou, Kostas
    Haenggi, Martin
    2010 IEEE GLOBAL TELECOMMUNICATIONS CONFERENCE GLOBECOM 2010, 2010,
  • [40] Multi-hop Reading Comprehension Incorporating Sentence-Based Reasoning
    Huo, Lijun
    Ge, Bin
    Zhao, Xiang
    WEB AND BIG DATA, PT I, APWEB-WAIM 2020, 2020, 12317 : 544 - 557