Operation-Augmented Numerical Reasoning for Question Answering

被引:1
|
作者
Zhou, Yongwei [1 ]
Bao, Junwei [2 ]
Wu, Youzheng [2 ]
He, Xiaodong [2 ]
Zhao, Tiejun [1 ]
机构
[1] Harbin Inst Technol, Sch Comp Sci & Technol, Machine Intelligence & Translat Lab, Harbin 150001, Peoples R China
[2] JD AI Res, Beijing 101111, Peoples R China
基金
中国国家自然科学基金;
关键词
Cognition; Task analysis; Semantics; Speech processing; Sorting; Question answering (information retrieval); Predictive models; Numerical reasoning; symbolic operations; semantic augmentation; mixture-of-experts;
D O I
10.1109/TASLP.2023.3316448
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Question answering requiring numerical reasoning, which generally involves symbolic operations such as sorting, counting, and addition, is a challenging task. To address such a problem, existing mixture-of-experts (MoE)-based methods design several specific answer predictors to handle different types of questions and achieve promising performance. However, they ignore the modeling and exploitation of fine-grained reasoning-related operations to support numerical reasoning, encountering the inadequacy in reasoning capability and interpretability. To alleviate this issue, we propose OPERA, an operation-augmented numerical reasoning framework. Concretely, we systematically define a scalable operation set to model numerical reasoning. We first identify reasoning-related operations based on context and then softly execute them to imitate the answer reasoning procedure via an operation-aware cross-attention mechanism. Finally, we utilize the operation-augmented semantic representation of execution results to support answer prediction. We verify the effectiveness and generalization of OPERA in two scenarios with different knowledge sources and reasoning capabilities. Specifically, we conduct extensive experiments on two textual datasets, DROP and RACENum, and a table-text hybrid dataset TAT-QA. Experiment results show that OPERA outperforms previous strong methods on the DROP, RACENum, and TAT-QA datasets. Further, we statistically and visually analyze its interpretability.
引用
收藏
页码:15 / 28
页数:14
相关论文
共 50 条
  • [41] Graph Reasoning Transformers for Knowledge -Aware Question Answering
    Zhao, Ruilin
    Zhao, Feng
    Hu, Liang
    Xu, Guandong
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 17, 2024, : 19652 - 19660
  • [42] Question Answering as Global Reasoning over Semantic Abstractions
    Khashabi, Daniel
    Khot, Tushar
    Sabharwal, Ashish
    Roth, Dan
    THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 1905 - 1914
  • [43] Reasoning with Heterogeneous Graph Alignment for Video Question Answering
    Jiang, Pin
    Han, Yahong
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 11109 - 11116
  • [44] PRIOR VISUAL RELATIONSHIP REASONING FOR VISUAL QUESTION ANSWERING
    Yang, Zhuoqian
    Qin, Zengchang
    Yu, Jing
    Wan, Tao
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1411 - 1415
  • [45] Combining Natural Logic and Shallow Reasoning for Question Answering
    Angeli, Gabor
    Nayak, Neha
    Manning, Christopher D.
    PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2016, : 442 - 452
  • [46] Analogical Reasoning for Answer Ranking in Social Question Answering
    Tu, Xudong
    Feng, Dan
    Wang, Xin-Jing
    Zhang, Lei
    IEEE INTELLIGENT SYSTEMS, 2012, 27 (05) : 28 - 35
  • [47] PathReasoner: Explainable reasoning paths for commonsense question answering
    Zhan, Xunlin
    Huang, Yinya
    Dong, Xiao
    Cao, Qingxing
    Liang, Xiaodan
    Knowledge-Based Systems, 2022, 235
  • [48] Video Question Answering with Spatio-Temporal Reasoning
    Yunseok Jang
    Yale Song
    Chris Dongjoo Kim
    Youngjae Yu
    Youngjin Kim
    Gunhee Kim
    International Journal of Computer Vision, 2019, 127 : 1385 - 1412
  • [49] SPARTQA: A Textual Question Answering Benchmark for Spatial Reasoning
    Mirzaee, Roshanak
    Faghihi, Hossein Rajaby
    Ning, Qiang
    Kordjamshidi, Parisa
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 4582 - 4598
  • [50] Visual Question Answering with Memory-Augmented Networks
    Ma, Chao
    Shen, Chunhua
    Dick, Anthony
    Wu, Qi
    Wang, Peng
    van den Hengel, Anton
    Reid, Ian
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 6975 - 6984