StepGame: A New Benchmark for Robust Multi-Hop Spatial Reasoning in Texts

被引:0
|
作者
Shi, Zhengxiang [1 ]
Zhang, Qiang [2 ]
Lipani, Aldo [1 ]
机构
[1] UCL, London, England
[2] Zhejiang Univ, Hangzhou, Zhejiang, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Inferring spatial relations in natural language is a crucial ability an intelligent system should possess. The bAbI dataset tries to capture tasks relevant to this domain (task 17 and 19). However, these tasks have several limitations. Most importantly, they are limited to fixed expressions, they are limited in the number of reasoning steps required to solve them, and they fail to test the robustness of models to input that contains irrelevant or redundant information. In this paper, we present a new Question-Answering dataset called StepGame for robust multi-hop spatial reasoning in texts. Our experiments demonstrate that state-of-the-art models on the bAbI dataset struggle on the StepGame dataset. Moreover, we propose a Tensor-Product based Memory-Augmented Neural Network (TP-MANN) specialized for spatial reasoning tasks. Experimental results on both datasets show that our model outperforms all the baselines with superior generalization and robustness performance.
引用
收藏
页码:11321 / 11329
页数:9
相关论文
共 50 条
  • [21] HOP, UNION, GENERATE: Explainable Multi-hop Reasoning without Rationale Supervision
    Zhao, Wenting
    Chiu, Justin T.
    Cardie, Claire
    Rush, Alexander M.
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 16119 - 16130
  • [22] To hop or not, that is the question: Towards effective multi-hop reasoning over knowledge graphs
    Liao, Jinzhi
    Zhao, Xiang
    Tang, Jiuyang
    Zeng, Weixin
    Tan, Zhen
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2021, 24 (05): : 1837 - 1856
  • [23] A New Concept of Knowledge based Question Answering (KBQA) System for Multi-hop Reasoning
    Wang, Yu
    Srinivasan, Vijay
    Jin, Hongxia
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 4007 - 4017
  • [24] Exploiting Reasoning Chains for Multi-hop Science Question Answering
    Xu, Weiwen
    Deng, Yang
    Zhang, Huihui
    Cai, Deng
    Lam, Wai
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 1143 - 1156
  • [25] Rethinking Offensive Text Detection as a Multi-Hop Reasoning Problem
    Zhang, Qiang
    Naradowsky, Jason
    Miyao, Yusuke
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), 2022, : 3888 - 3905
  • [26] A Multi-Hop Reasoning Knowledge Selection Module for Dialogue Generation
    Ma, Zhiqiang
    Liu, Jia
    Xu, Biqi
    Lv, Kai
    Guo, Siyuan
    ELECTRONICS, 2024, 13 (16)
  • [27] GMH: A General Multi-hop Reasoning Model for KG Completion
    Zhang, Yao
    Liang, Hongru
    Jatowt, Adam
    Lei, Wenqiang
    Wei, Xin
    Jiang, Ning
    Yang, Zhenglu
    2021 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2021), 2021, : 3437 - 3446
  • [28] ExKGR: Explainable Multi-hop Reasoning for Evolving Knowledge Graph
    Yan, Cheng
    Zhao, Feng
    Jin, Hai
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2022, PT I, 2022, : 153 - 161
  • [29] Attention-based Multi-hop Reasoning for Knowledge Graph
    Wang, Zikang
    Li, Linjing
    Zeng, Daniel Dajun
    Chen, Yue
    2018 IEEE INTERNATIONAL CONFERENCE ON INTELLIGENCE AND SECURITY INFORMATICS (ISI), 2018, : 211 - 213
  • [30] Explainable Multi-hop Verbal Reasoning Through Internal Monologue
    Liang, Zhengzhong
    Bethard, Steven
    Surdeanu, Mihai
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 1225 - 1250