Exploring Synergies between Causal Models and Large Language Models for Enhanced Understanding and Inference

被引:0
|
作者
Sun, Yaru [1 ]
Yang, Ying [1 ]
Fu, Wenhao [1 ]
机构
[1] Third Res Inst, Minist Publ Secur, Shanghai, Peoples R China
关键词
Large Language Models; Causal Models; Causal Knowledge Infusion; Causal Perception;
D O I
10.1145/3663976.3664023
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Large language models (LLMs) have sparked a new wave of excitement in the field of artificial intelligence, thanks to their robust generative capabilities. However, they fall short when it comes to comprehending factual knowledge and logical reasoning. In contrast, causal models demonstrate superior interpretability, resilience against disturbances, and decision-support capabilities. By integrating event generation mechanisms and external knowledge, causal models can enhance the reasoning and interpretability of LLMs. Nevertheless, the complex construction and iterative nature of causal models pose challenges that push the boundaries of current frameworks. Thus, leveraging the strengths of both LLMs and causal models can effectively address the limitations of LLMs in logical reasoning, complex inference, and causal deduction, as well as tackle the complexities and difficulties encountered in the establishment and analysis of causal models. These challenges include distinguishing between relevance and causality, handling reverse relationships, and managing interactions. This paper proposes a technical roadmap for a collaborative approach between LLMs and causal models, exploring four different methods of collaboration: causal relationship modeling, causal knowledge injection, causal perception, and causal relationship constraints. We review and summarize existing work while identifying future research directions in harnessing the synergy between LLMs and causal models.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] The Importance of Understanding Language in Large Language Models
    Youssef, Alaa
    Stein, Samantha
    Clapp, Justin
    Magnus, David
    [J]. AMERICAN JOURNAL OF BIOETHICS, 2023, 23 (10): : 6 - 7
  • [2] Causal Dataset Discovery with Large Language Models
    Liu, Junfei
    Sun, Shaotong
    Nargesian, Fatemeh
    [J]. WORKSHOP ON HUMAN-IN-THE-LOOP DATA ANALYTICS, HILDA 2024, 2024,
  • [3] Assessing Inference Time in Large Language Models
    Walkowiak, Bartosz
    Walkowiak, Tomasz
    [J]. SYSTEM DEPENDABILITY-THEORY AND APPLICATIONS, DEPCOS-RELCOMEX 2024, 2024, 1026 : 296 - 305
  • [4] Graphical models, causal inference, and econometric models
    Spirtes, Peter
    [J]. JOURNAL OF ECONOMIC METHODOLOGY, 2005, 12 (01) : 3 - 34
  • [5] Understanding Telecom Language Through Large Language Models
    Bariah, Lina
    Zou, Hang
    Zhao, Qiyang
    Mouhouche, Belkacem
    Bader, Faouzi
    Debbah, Merouane
    [J]. IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 6542 - 6547
  • [6] The RL/LLM Taxonomy Tree: Reviewing Synergies between Reinforcement Learning and Large Language Models
    Pternea, Moschoula
    Singh, Prerna
    Chakraborty, Abir
    Oruganti, Yagna
    Milletari, Mirco
    Bapat, Sayli
    Jiang, Kebei
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2024, 80 : 1525 - 1573
  • [7] Exploring Large Language Models for Classical Philology
    Riemenschneider, Frederick
    Frank, Anette
    [J]. PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023): LONG PAPERS, VOL 1, 2023, : 15181 - 15199
  • [8] Combining large language models with enterprise knowledge graphs: a perspective on enhanced natural language understanding
    Mariotti, Luca
    Guidetti, Veronica
    Mandreoli, Federica
    Belli, Andrea
    Lombardi, Paolo
    [J]. FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2024, 7
  • [9] The RL/LLM Taxonomy Tree: Reviewing Synergies between Reinforcement Learning and Large Language Models
    Pternea, Moschoula
    Singh, Prerna
    Chakraborty, Abir
    Oruganti, Yagna
    Milletari, Mirco
    Bapat, Sayli
    Jiang, Kebei
    [J]. Journal of Artificial Intelligence Research, 2024, 80 : 1525 - 1573
  • [10] Understanding, Uncovering, and Mitigating the Causes of Inference Slowdown for Language Models
    Varma, Kamala
    Numanoglu, Arda
    Kaya, Yigitcan
    Dumitras, Tudor
    [J]. IEEE CONFERENCE ON SAFE AND TRUSTWORTHY MACHINE LEARNING, SATML 2024, 2024, : 723 - 740