Towards Analysis and Interpretation of Large Language Models for Arithmetic Reasoning

被引:0
|
作者
Akter, Mst Shapna [1 ]
Shahriar, Hossain [2 ]
Cuzzocrea, Alfredo [3 ,4 ]
机构
[1] Univ West Florida, Dept Intelligent Syst & Robot, Pensacola, FL 32514 USA
[2] Univ West Florida, Ctr Cybersecur, Pensacola, FL USA
[3] Univ Calabria, iDEA Lab, Arcavacata Di Rende, Italy
[4] Univ Paris City, Dept Comp Sci, Paris, France
关键词
LLMs; Arithmetic Reasoning; Causal Mediation Analysis;
D O I
10.1109/SDS60720.2024.00049
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Large Language Models (LLMs) have recently conquered the research scene, with particular regards to the Transformer architecture in the context of arithmetic reasoning. In this so-delineated scenario, this paper puts the basis for a causal mediation analysis about the approach of Transformer-based LLMs to complex arithmetic problems. In particular, we try to discover which parameters are crucial for complex reasoning tasks such as model activations. Our preliminary results state that, for complex arithmetic operations, information is channeled from mid-layer activations to the final token through enhanced attention mechanisms. Preliminary experiments are reported.
引用
收藏
页码:267 / 270
页数:4
相关论文
共 50 条
  • [31] Reasoning with large language models for medical question answering
    Lucas, Mary M.
    Yang, Justin
    Pomeroy, Jon K.
    Yang, Christopher C.
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2024, 31 (09)
  • [32] Rationality of Thought Improves Reasoning in Large Language Models
    Gou, Tian
    Zhang, Boyao
    Sun, Zhenglie
    Wang, Jing
    Liu, Fang
    Wang, Yangang
    Wang, Jue
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT IV, KSEM 2024, 2024, 14887 : 343 - 358
  • [33] Performance evaluation of large language models with chain-of-thought reasoning ability in clinical laboratory case interpretation
    Yang, He S.
    Li, Jieli
    Yi, Xin
    Wang, Fei
    CLINICAL CHEMISTRY AND LABORATORY MEDICINE, 2025,
  • [34] TOWARDS A CONVERSATIONAL ETHICS OF LARGE LANGUAGE MODELS
    Kempt, Hendrik
    Lavie, Alon
    Nagel, Saskia K.
    AMERICAN PHILOSOPHICAL QUARTERLY, 2024, 61 (04) : 339 - 354
  • [35] Towards Safer Large Language Models (LLMs)
    Lawrence, Carolin
    Bifulco, Roberto
    Gashteovski, Kiril
    Hung, Chia-Chien
    Ben Rim, Wiem
    Shaker, Ammar
    Oyamada, Masafumi
    Sadamasa, Kunihiko
    Enomoto, Masafumi
    Takeoka, Kunihiro
    NEC Technical Journal, 2024, 17 (02): : 64 - 74
  • [36] NavGPT: Explicit Reasoning in Vision-and-Language Navigation with Large Language Models
    Zhou, Gengze
    Hong, Yicong
    Wu, Qi
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 7, 2024, : 7641 - 7649
  • [37] IdealGPT: Iteratively Decomposing Vision and Language Reasoning via Large Language Models
    You, Haoxuan
    Sun, Rui
    Wang, Zhecan
    Chen, Long
    Wang, Gengyu
    Ayyubi, Hammad A.
    Chang, Kai-Wei
    Chang, Shih-Fu
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 11289 - 11303
  • [38] Large Language Models Are Partially Primed in Pronoun Interpretation
    Lam, Suet-Ying
    Zeng, Qingcheng
    Zhang, Kexun
    You, Chenyu
    Voigt, Rob
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 9493 - 9506
  • [39] Using Large Language Models for the Interpretation of Building Regulations
    Fuchs, Stefan
    Witbrock, Michael
    Dimyadi, Johannes
    Amor, Robert
    Journal of Engineering, Project, and Production Management, 2024, 14 (04)
  • [40] On Implementing Case-Based Reasoning with Large Language Models
    Wilkerson, Kaitlynne
    Leake, David
    CASE-BASED REASONING RESEARCH AND DEVELOPMENT, ICCBR 2024, 2024, 14775 : 404 - 417